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Preface 



This is the second edition of my book Galaxy Formation. Many people liked the first 
edition which appeared in 1998, just before the explosion of magnificent new data 
which have completely changed the face of astrophysical cosmology. Many of the 
agonies which had to be gone through in the first edition have disappeared and, to 
many people’s amazement, including mine, there is now a concordance model for 
cosmology, the cosmologist’s equivalent of the particle physicist’s standard model. 
Just like the standard model, however, the concordance model creates as many 
problems as it solves. This is not a cause for concern, but rather one for celebration 
because we are now able to ask much better and deeper questions than in the past. 
These questions indicate clearly the need for physics and astrophysics ‘Beyond the 
Concordance Model’. 

The object of this new edition is to bring this amazing story up-to-date, very much 
in the spirit of the first edition. To recapitulate some of the points made in the previous 
preface about the origin of the book, I was asked by Springer- Verlag to expand the set 
of lecture notes that I prepared in 1988 for the First Astrophysics School organised 
by the European Astrophysics Doctoral Network into a full-length book. The set of 
notes was entitled Galaxy Formation and was published as a chapter of the volume 
Evolution of Galaxies: Astronomical Observations (eds. I. Appenzeller, H.J. Habing 
and P. Lena, pages 1 to 93, Springer- Verlag Berlin, Heidelberg, 1989). In that chapter, 
I attempted to bridge the gap between elementary cosmology and the technical papers 
appearing in the literature which can seem quite daunting on first encounter. The 
objective was to present the physical ideas and key results as clearly as possible as 
an introduction and guide to the technical literature. 

In 1993, more lecture notes on The Physics of Background Radiation were 
prepared for the 23rd Advanced Course of the Swiss Society of Astrophysics 
and Astronomy, the topic being The Deep Universe (A.R. Sandage, R.G. Kron 
and M.S. Longair, Springer- Verlag Berlin, Heidelberg, 1995). Then, also in 1993, 
I completed a history of twentieth century astrophysics and cosmology, which was 
published as Chap. 23 of a three-volume work entitled Twentieth Century Physics 
(eds. L.M. Brown, A. Pais and A.B. Pippard, IOP Publications, AIP Press Bris- 
tol, and New York 1995). A much enlarged full-length book on this topic entitled 
The Cosmic Century: A History of Astrophysics and Cosmology was published by 
Cambridge University Press in 2006. That book brought the story of the origin of 
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galaxies and the large-scale structure of the Universe up-to-date as of October 2005 
and it has been further updated and expanded in the present book. Just as in the 
first edition, the present volume is much more than a recycled and concatenated 
version of previously published works. I have rewritten and rethought the original 
versions, expanded some parts, brought everything up-to-date and included new 
material. 

I often find that I understand things best, and present them most clearly, when 
I have to prepare them for students, at either the undergraduate or the post-graduate 
level, and so 1 have adopted the same form of presentation here. I have intentionally 
presented the material in an informal, pedagogical manner, and attempted to avoid 
getting bogged down in formalities and technicalities. If the material becomes too 
difficult, I simply summarise the key points, give some appropriate references and 
pass on. My approach is to reduce the problems to their simplest form and ratio- 
nalise from these examples the results of more complete analyses. Wherever it is 
feasible without excessive effort, we will attempt to derive exact results. The level 
of presentation is intended to be appropriate for a final-year undergraduate or first- 
year post-graduate course of lectures. In other words, it is assumed that the reader 
has a good grasp of basic physics but does not necessarily have the appropriate 
background in astronomy, astrophysics or cosmology. My aim has been to write 
a user-friendly book, taking particular care to expound carefully areas where I have 
found students have difficulty. 

When I wrote the original set of lecture notes on galaxy formation, my objective 
was to tell the story of modern astrophysical cosmology from the perspective of 
one of its most important and fundamental problems of cosmology - how did the 
galaxies come about? I enjoy this approach to the exposition of modern cosmology 
because, to do the problem justice, it is essential to introduce the whole of what 
I call classical cosmology, as the framework for the discussion. This approach has, 
for me, the great advantage of concentrating upon a crucial problem of astrophysical 
cosmology rather than regarding the objective of cosmology as being simply the 
delineation of a preferred cosmological model, however interesting that is in its 
own right. As we will show, the origin of galaxies and larger-scale structures in the 
Universe is one of the great cosmological problems and has provided us with unique 
and direct information about the physics of the very early Universe. 

This new understanding brings with it the question of whether or not the old 
structure of the book is really appropriate - do we really need to grind through all 
the old story in order to understand the problems raised by the concordance model? 
My decision has been to maintain much of the original structure of the book, largely 
because the approach was very strongly physics-motivated and the old story reveals 
much of the essential physics of the concordance model. 

One final warning is in order. I make no claim that this presentation is com- 
plete, unbiased or objective. You should regard the book as my own impressions 
and opinions of what I consider to be the important issues of modern astrophysi- 
cal cosmology. Others would tell the story in a completely different way and put 
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emphasis upon different parts of what is unquestionably a multi-dimensional story. 
1 will endeavour to include as wide a spectrum of ideas and opinions as possible, 
but the text will inevitably be incomplete. I do not worry about this - it should 
encourage you to read as widely as possible in order to neutralise my prejudices and 
biases. 

Good Luck! 



Venice and Cambridge, 

July 2007 Malcolm Longair 
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1 A Brief History of Cosmology 
and Galaxy Formation 



We begin with a broad-brush historical overview of the development of ideas and 
concepts concerning galaxies, cosmology and galaxy formation from the time of 
Tycho Brahe and Newton to the present day. This chapter therefore summarises 
qualitatively many of the key topics to be dealt with in quantitative detail in the rest 
of this book. 1 If you do not need this gentle introduction, or misguidedly think that 
history is boring, you may pass straightaway to Chap. 2. 



1.1 Pre-History 

It always comes as a surprise to me to realise how recent our understanding of 
galaxies, cosmology and galaxy formation really is. The motions of the Sun, Moon 
and planets against the background of the ‘fixed stars’ had been studied from ancient 
times, but the scientific study of their motions in the modern sense only began in the 
sixteenth century. 

The developments which led to the Newtonian revolution can be traced to the 
technological and observational achievements of Tycho Brahe in the final decades 
of the sixteenth century. I have told this remarkable story elsewhere (Longair, 2003). 
Tycho Brahe measured the positions of the Sun, Moon, planets and 777 stars over 
a period of 20 years, resulting in an order of magnitude improvement in the accuracy 
with which their orbits were determined over all previous measurements. In the year 
before his death, he employed Johannes Kepler as his assistant and assigned him 
the task of working out the orbits of the planets from his magnificent data sets. In 
the period 1601 to 1619, Kepler succeeded in interpreting the mass of Tycho’s data 
in terms of elliptical planetary orbits about the Sun, which was located in one of 
the foci of each ellipse. Kepler’s discovery of his three laws of planetary motion 
was a miracle of geometrical analysis. The three laws embody not only the elliptical 
orbits of the planets (the first law), but also the areal law - that equal areas are 
swept out by the radius vector from the Sun to the planet in equal times (the second 
law) - and the dependence of the period T of the planet’s orbit about the Sun upon 
the three-halves power of its mean distance r from the Sun, T oc C' 2 , (the third 
law). 

1 Comprehensive references to the original papers discussed in this chapter can be found in 

my book The Cosmic Century: A History of Astrophysics and Cosmology (Longair, 2006). 
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In 1664, at the age of only 21, Isaac Newton first derived his law of gravity from 
Kepler’s third law. Writing Newton ’s law of gravity in modern vector notation. 



M\M 2 . 

I = — G — 1, , 

r - 



(1.1) 



where f is the gravitational force acting between two point masses M\ and M 2 
separated by a distance r and i, is the unit vector in the direction from one mass 
to the other. Newton’s achievement was to unify the terrestrial law of gravity with 
celestial dynamics by showing that (1.1) could explain the acceleration of falling 
apples on Earth and the orbits of the planets. It is no exaggeration to say that 
astronomy, astrophysics and cosmology are the sciences of gravity - all the systems 
we study in astronomy and cosmology are attempting to counteract the omnipresent 
attractive force of gravity by one means or another. 

In 1692 Richard Bentley gave the first series of Boyle Lectures which Robert 
Boyle had founded ‘to combat atheism’ . Bentley took as his theme Newton’s ‘sublime 
discoveries’ and entered into a short but profound correspondence with Newton 
about the nature of our physical Universe. The question at issue was the stability of 
a finite or infinite Universe filled with stars under the attractive force of gravity. The 
conclusion of the correspondence was that the Universe must be infinite because, if 
it were not, it would collapse to its centre under gravity. With remarkable insight, 
they recognised, however, that an infinite Universe filled with stars is gravitationally 
unstable. If a star is displaced from its equilibrium position, it continues to accelerate 
in that direction. To quote Harrison: 

(Newton) agreed with Bentley that providence had designed a universe of 

infinite extent in which uniformly distributed stars stand poised in unstable 

equilibrium like needles on their points (Harrison, 1987). 

It was only in the twentieth century that the nature of this instability was fully 
appreciated. For a static medium, the instability criterion and the growth rate of the 
instability were derived by James Jeans in 1902 and the corresponding results for an 
expanding medium by Georges Lemaitre, Howard Robertson and Evgenii Lifshitz in 
the 1930s and 1940s. Their results are central to the understanding of the problems 
of the formation of structure in the Universe and the modern working out of their 
basic insights will dominate much of the discussion throughout this book. 

As part of the dialogue with Bentley, Newton proposed that the stars are objects 
like the Sun and he made star counts in an attempt to show that the stars are 
indeed uniformly distributed in space. From the seventeenth century onwards, most 
astronomers assumed that the stars are objects similar to the Sun, but at vastly 
greater distances. The problem was to find means of measuring their distances. If 
they were assumed to have the same intrinsic luminosities as the Sun, the inverse 
square law could be used to estimate distances by comparing the relative brightnesses 
of the Sun and the distant stars. The technical problem was that the Sun is so much 
brighter than the brightest stars that is was difficult to make good estimates of the 
ratio of their observed flux densities, or apparent magnitudes. An ingenious solution 
was discovered in 1668 by James Gregory, who used Jupiter as an intermediate 
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luminosity calibrator, assuming that its light was entirely sunlight reflected from the 
disc of the planet and that its surface was a perfect reflector. Then, the apparent 
magnitudes of Jupiter and the bright star Sirius could be compared and the distance 
of Sirius was found to be about 83,190 astronomical units (Gregory, 1668). The 
same method was used by John Michell in 1767 using Saturn as an intermediary to 
estimate of a distance of 460,000 astronomical units for Vega, or a Lyrae. 

The method of Gregory and Michell depended upon the assumption that all 
the stars have the same absolute luminosities. The first direct measurements of 
stellar distances were made by the technique of stellar parallaxes, the first successful 
measurement being announced in 1838 by Friedrich Bessel for the star 61 Cygni. The 
measurement of stellar parallaxes was however difficult and demanding technically 
and by 1900 only about 100 parallaxes were known for stars in the vicinity of the 
Sun. 



1.2 The Galaxies and the Structure of our Galaxy 

In his extraordinary text of 1610, the Side reus Nuncius or The Sidereal Messenger, 
Galileo Galilei demonstrated that the Milky Way can be resolved into stars when 
observed through the telescope. These observations led to the earliest speculative 
cosmologies of the modern era. The ‘island universe’ model of Rene Descartes, 
published in The World of 1636, involved an interlocking jig-saw puzzle of solar 
systems. In 1750, Thomas Wright of Durham published An Original Theory or New 
Hypothesis of the Universe, in which the Sun was one of many stars which orbit 
the ‘Divine Centre’ of the star system. Immanuel Kant in 1755 and Johann Lambert 
in 1761 took these ideas further and developed the first hierarchical, or fractal, 
models of the Universe. Kant also made the prescient suggestion that the flattening 
of these ‘island universes’ was due to their rotation. The problem with these early 
cosmologies was that they lacked observational validation. 

Towards the end of the eighteenth century, William Herschel was one of the first 
astronomers to attempt to define the distribution of stars in the Universe in some 
detail on the basis of careful astronomical observation. To determine the structure 
of the Milky Way, he counted the numbers of stars in different directions. Then, 
assuming that they all have the same intrinsic luminosities, he derived his famous 
picture for the structure of our Galaxy which consisting of a flattened disc of stars 
with diameter about five times its thickness, the Sun being located close to its centre 
(Fig. 1.1) (Herschel, 1785). 

John Michell had already warned Herschel that the assumption that the stars have 
a fixed luminosity was a poor approximation. In his remarkable pioneering paper of 
1767, Michell introduced statistical methods into astronomy in order to show that 
binary and star clusters must be real physical systems and not random associations 
of stars on the sky (Michell, 1767). Consequently, there must be a dispersion in the 
absolute luminosities of the stars from their observed range of apparent magnitudes in 
bright star clusters, such as the Pleiades. Despite this warning, Herschel proceeded to 
produce a number of different versions of his model for the structure of our Galaxy, 
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Fig. 1.1 . William Herschel’s model of the Galaxy based upon star counts in different directions. 
The Sun is located close to the centre of the disc of stars (Herschel, 1785) 



adding appendages to account for various features of the star counts in different 
directions. 

In 1802, Herschel measured the magnitudes of visual binary stars and was forced 
to agree with Michell’s conclusion about the wide dispersion in the luminosities of 
the stars (Herschel, 1802). Equally troubling was the fact that observations with his 
magnificent 40-foot telescope showed that, the fainter he looked, the more stars he 
continued to find. There seemed to be no edge to the Galaxy and Herschel gradually 
lost faith in his model. In addition, the importance of interstellar extinction by dust 
was not appreciated - it was only in the 1930s that its central importance for studies 
of our own and other galaxies was fully appreciated. 

Even before the discovery of the telescope, it had been realised that there exist 
‘nebulous’ objects which differ from the stars in having a diffuse or fuzzy appearance. 
Kant, Lambert, Swedenborg and Wright argued that these objects were ‘island 
universes’ similar to the Milky Way, but too distant to be resolved into stars. There 
was, however, no observational basis for this hypothesis. Herschel also inferred that 
the nebulae were island universes similar to our Galaxy. A test of this picture was 
to show that the nebulae could be resolved into stars and he believed that this had 
been achieved in a number of cases. In others, he assumed that the nebulae were 
too distant to be resolved into individual stars. This picture came into question, 
however, when he discovered that, among the nebulae were the planetary nebulae, 
which consist of a central star surrounded by a shell of gas. Herschel recognised 
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that these nebulae were unlikely to be resolved into stars but rather consisted of 
'luminous fluid’ surrounding the central star. 

The cataloguing of the bright nebulae was begun by Charles Messier whose 
catalogue of 109 objects was compiled during the years 1771 to 1784. Messier’s 
interest was primarily in comets and his objective in compiling the catalogue was 
to enable him to distinguish between diffuse nebulae and comets. The catalogue 
contains a mixture of what we now know are the brightest Galactic and extra- 
galactic nebulae and they are still commonly referred to by their Messier, or M, 
numbers. 

The systematic cataloguing of the nebulae was begun by William Herschel 
and his sister Caroline and was continued through the first half of the nineteenth 
century by his son John Herschel. The results of these huge endeavours was the 
publication by John Herschel in 1864 of the General Catalogue of Nebulae and 
Clusters of Stars containing 5079 objects. These catalogues were based upon visual 
observations long before photography became a standard tool of the astronomer. In 
1888, John Dreyer published an expanded catalogue which was known as the New 
General Catalogue of Nebulae and Clusters of Stars which, together with the two 
supplementary Index Catalogues of 1895 and 1908, contain some 15,000 objects. 
Objects in these catalogues are still commonly referred to by their NGC or IC 
numbers. 

While the cataloguing of the nebulae proceeded apace, their nature remained 
a mystery. Undoubtedly, some of them were gas clouds, as demonstrated by William 
Huggins’ pioneering spectroscopic observations of diffuse nebulae in the 1860s 
(Huggins and Miller, 1864). The big question was whether or not the ‘spiral nebulae’ 
were objects within our own Galaxy or were more distant systems. These nebulae 
were beyond the distances at which conventional techniques of distance measurement 
could be used. This problem culminated in what became known as ‘The Great 
Debate’ and concerned two related issues. Firstly, what is the size of our own Galaxy 
and, secondly, are the spiral nebulae members of our Galaxy or are they separate 
‘island universes’, well beyond the confines of our Galaxy? This key episode in 
the history of modern astronomy should be required reading for all observers and 
theorists (Sandage, 1961b; Hoskin, 1976; Smith, 1982; Trimble, 1995). 

To illustrate the nature of the problem, by 1920, Jacobus Kapteyn had determined 
the luminosity function of stars near the Sun and so, from star counts in different 
directions, determined the structure of the Galaxy which he found to be highly 
flattened with dimensions 1500 pc perpendicular to the plane and about 8 times that 
size in the Galactic plane (Fig. 1.2) (Kapteyn, 1922). 

Meanwhile, Harlow Shapley had adopted a quite different approach to the de- 
termination of Galactic structure. In 1912, Henrietta Leavitt had discovered the 
remarkable period-luminosity relation for Cepheid variable stars in the Magellanic 
Clouds (Fig. 1.3). This discovery provided a powerful means of measuring astro- 
nomical distances because the Cepheid variables are intrinsically luminous stars 
and their distinctive light curves can be recognised in stars in distant systems. The 
Cepheid variables were the tools used by Harlow Shapley to determine the structure 
of the Galaxy through his studies of globular clusters. He found the scale of the 
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Fig. 1.2. Kapteyn’s model for the distribution of stars in the Galaxy (Kapteyn, 1922). The 
diagram shows the distribution of stars in a plane perpendicular to the Galactic plane. The 
curves are lines of constant number density of stars and are in equal logarithmic steps. The 
Sun S is slightly displaced from the centre of the system 
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Fig. 1.3. A plot of the period- 
luminosity relation for the 25 
Cepheid variables discovered by 
Leavitt in the Small Magellanic 
Cloud (Leavitt, 1912). The upper 
locus is found for the maximum 
light of the Cepheid variables and 
the lower line for their minimum 
brightnesses 



globular cluster system to be enormous, the most distance globular cluster having 
a distance of 67 kpc. Furthermore, the globular cluster system was not centred upon 
the Solar System, but rather most of the globular clusters were found in a direction 
centred upon the constellation of Sagittarius (Fig. 1.4) (Shapley, 1918). 

The course of the debate was complex, but the issues were resolved finally and 
conclusively in 1925 by Edwin Hubble’s observations of Cepheid variables in the 
Andromeda Nebula. Using the period-luminosity relation for Cepheid variables, he 
established to everyone’s satisfaction that the spiral nebulae are distant extragalactic 
systems. 

Within a year, Hubble had published the first major survey of the properties 
of galaxies as extragalactic systems. In his remarkable paper (Hubble, 1926), he 
introduced an early version of his classification of galaxies into ellipticals, spirals 
and irregulars, estimated mass-to-light ratios for these different types of galaxies, 
used number counts of galaxies to show that they are uniformly distributed in space 
and hence estimated the mean density of matter in the Universe in the form of 
galaxies. Adopting Einstein’s static model of the Universe, he found that the radius 
of curvature of its spherical geometry was 27,000 Mpc. He estimated that, with the 
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Fig. 1.4. The distribution of globular clusters in the Galaxy according to Shapley’s distance 
measurements (Shapley, 1918). The scales on the abscissa and ordinate are in units of 100 pc 
and correspond to distances in and perpendicular to the Galactic plane respectively. The Sun, 
located at zero coordinates on the abscissa and ordinate, lies towards one edge of the globular 
cluster system 




100-inch Hooker telescope, he could observe typical galaxies to about 1/600 of the 
radius of the Universe. He concluded with the remark that 

. . . with reasonable increases in the speed of plates and sizes of telescopes, 
it may become possible to observe an appreciable fraction of the Einstein 
universe. 

This paper marked the beginning of extragalactic astronomy. It comes as no surprise 
to learn that George Ellery Hale began his campaign to raise funds for the Palomar 
200-inch telescope in 1928 - before the year was out, he had secured a grant of 
$6 million from the Rockefeller Foundation for the telescope, the construction of 
which was completed in 1949. 

In 1929, Hubble made his second fundamental contribution to cosmology. He 
showed that the extragalactic nebulae are all moving away from our own Galaxy and 
that their recessional velocities v are proportional to their distances r from our Galaxy 
(Fig. 1.5a) (Hubble, 1929). It is remarkable that he was able to deduce this key result 
from such a small sample of nearby galaxies but, within five years, he and Humason 
had extended the relation to very much greater velocities and distances using the 
apparent magnitudes of the fifth brightest members of clusters of galaxies as distance 
indicators (Fig. 1 ,5b). The velocity-distance relation v = Hor is commonly referred 
to as Hubble’s law and Hq as Hubble’s constant. The significance of this discovery 
was that, combined with the isotropy of the Universe, Hubble’s law demonstrates 
that the whole system of galaxies is partaking in a uniform expansion. 
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Fig. 1.5. a Hubble’s first velocity-distance relation for nearby galaxies (Hubble, 1929). The 
filled circles and the full line represent a solution for the solar motion using the nebulae 
individually; the open circles and the dashed line represent a solution combining the nebulae 
into groups. The cross is an estimate of the mean distance of the other 20 galaxies for 
which radial velocities were available, b The velocity-apparent magnitude relation for the 
fifth brightest member of clusters of galaxies, corrected for galactic obscuration (Hubble 
and Humason, 1934). Each cluster velocity is the mean of the various individual velocities 
observed in the cluster, the number being indicated by the figure in brackets. 



1.3 The Theory of the Expanding Universe 

In Newton’s Principia Mathematica, he emphatically took the position that all motion 
takes place with respect to a system of absolute space and time. He fiercely rejected 
the idea that the motion of a body could only be described relative to those of 
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other bodies. This position was challenged by Bishop Berkeley, Christiaan Huygens 
and others but, at least until the late nineteenth century, Newton’s view prevailed. 
The issue was revived by Ernst Mach who argued that motion can only be defined 
relative to other bodies. Specifically, he took the view that the local inertial frame 
of reference is determined by the frame of the distant stars, or galaxies in modern 
parlance. Thus, a freely swinging Foucault pendulum swings in a reference frame 
which is fixed relative to the distant galaxies. Albert Einstein gave the name Mach’s 
principle to this idea. 

During the late eighteenth century, non-Euclidean geometries began to be taken 
seriously by mathematicians who realised that the fifth postulate of Euclid, that 
parallel lines meet only at infinity, might not be essential for the construction of 
a self-consistent geometry. Proposals that the global geometry of space might not 
be Euclidean were discussed by Girolamo Saccheri and Johann Lambert. In 1816, 
Carl Friedrich Gauss repeated this proposal in a letter to Christian Gerling and was 
aware of the fact that a test of the local geometry of space could be carried out by 
measuring the sum of the angles of a triangle between three high peaks, the Brocken, 
Hoherhagen and Inselberg. In 1818, Gauss was asked to carry out a geodetic survey 
of the state of Hanover and he devoted a large effort to carrying out and reducing 
the data himself. He was certainly aware of the fact that the sum of the angles of the 
triangle was 180° within the limits of geodetic measurements. 

The fathers of non-Euclidean geometry were Nikolai Lobachevsky, who became 
rector of Kazan University in Russia in 1827, and Janos Bolyai in Transylvania, 
then part of Hungary. In the 1820s, they independently solved the problem of the 
existence of non-Euclidean geometries and showed that Euclid’s fifth postulate could 
not be deduced from the other postulates (Lobachevsky, 1829, 1830; Bolyai, 1832). 
In his papers entitled On the Principles of Geometry, Lobachevsky also proposed 
an astronomical test of the geometry of space. If the geometry were hyperbolic, the 
minimum parallax of any object would be 

6 = arctan ( — \ , (1-2) 

V -R ' 

where a is the radius of the Earth’s orbit and .R the radius of curvature of the 
geometry. He found a minimum value of IR > 1.66 x 10 5 AU = 2.6 light years, 
using an observational upper limit of 1 arcsec for the parallax of bright stars. In 
a prescient statement which will warm the hearts of observational astronomers, he 
remarked: 

There is no means other than astronomical observations for judging the 

exactness which attaches to the calculations of ordinary geometry. 

Non-Euclidean geometries were placed on a firm theoretical basis by Bernhard Rie- 
mann, who also discovered closed spherical geometries. The English-speaking world 
was introduced these ideas through the works of William Clifford and Arthur Cayley. 
Until Albert Einstein’s discovery of the General Theory of Relativity, considerations 
of the geometry of space and the role of gravity in defining the large-scale structure 
of the Universe were separate questions. After 1915, they were inextricably linked. 
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In that year, after a titanic intellectual struggle, Einstein discovered the definitive 
version of his General Theory of Relativity which describes how space-time is 
distorted by the presence of matter and how, in turn, matter moves along trajec- 
tories in bent space-time (Einstein, 1915, 1916). For the first time, a relativistic 
theory of gravity was available which enabled self-consistent models of the Uni- 
verse as a whole to be constructed and, characteristically, Einstein did not hesitate 
to do so. 

In seeking a solution of his field equations for the Universe as a whole, Einstein 
had explicitly in mind that Mach’s principle should be incorporated into any model 
of the large-scale structure of the Universe. He had, however, a major problem. 
Without modification, the held equations predicted that the Universe was unstable. 
He could only find static solutions by introducing what is now known as the cosmical 
or cosmological constant X, which appears as a constant in Einstein’s held equations. 
In his great paper of 1917, Einstein showed that the introduction of the cosmological 
constant resulted in static solutions for the Universe as a whole which had closed, 
spherical geometry and a hnite size (Einstein, 1917). He also believed that he had 
incorporated Mach’s principle into General Relativity, in the sense that no solution 
of the equations would exist if there were no matter present. In the same year, 
this was, however, shown to be incorrect by Willem de Sitter, who found solutions 
of the equations even if there were no matter present in the Universe (de Sitter, 
1917). 

For many decades, the status of the cosmological constant was the subject of 
debate. In 1919, Einstein realised that a term involving the cosmological constant 
would appear in the held equations of General Relativity, quite independent of its 
cosmological significance (Einstein, 1919). In the derivation of the held equations, 
the A-term appears as a constant of integration which is normally set equal to zero in 
the development of standard General Relativity. Einstein was not enthusiastic about 
the term, remarking that it ‘detracts from the formal beauty of the theory’ . Willem 
de Sitter wrote in 1919 that the term 

. . . detracts from the symmetry and elegance of Einstein’s original the- 
ory, one of whose chief attractions was that it explained so much without 
introducing any new hypotheses or empirical constant. 

Others regarded it as a constant which appears in the development of the General 
Relativity and its value should be determined by astronomical observation. 

The irony of the situation is that this debate took place before it was realised 
that the Universe is in fact non- stationary. In 1922, Aleksander Friedman published 
the first of two classic papers in which he discovered both static and expanding 
solutions of Einstein’s field equations. In the first paper, Friedman found solutions for 
expanding universes with closed spatial geometries, including those which expand 
to a maximum radius and eventually collapse to a singularity (Friedman, 1922). In 
the second paper of 1924, he showed that there exist expanding solutions which are 
unbounded and which have hyperbolic geometry (Friedman, 1924). These solutions 
correspond exactly to the standard world models of general relativity and are known 
as the Friedman world models. 
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In 1925, Friedman died of typhoid in Leningrad before the fundamental sig- 
nificance of his work was appreciated. The neglect of Friedman’s work in these 
early days is somewhat surprising since Einstein had commented, incorrectly as he 
admitted, on the first of the two papers in 1923. It was not until Georges Lemaitre 
independently rediscovered the same solutions in 1927, and then became aware 
of Friedman’s papers, that the pioneering nature of Friedman’s contributions was 
appreciated (Lemaitre, 1927). 

Einstein’s field equations without the cosmological constant contain perfectly 
satisfactory solutions in which the Universe is uniformly expanding. According 
to George Gamow, when the expansion of the Universe was discovered, Einstein 
regarded the introduction of the cosmological constant as ‘the biggest blunder of my 
life’ (Gamow, 1970). The cosmological constant was not consigned to oblivion for 
long however. As Yakov Zeldovich remarked: 

The genie is out of the bottle and, once he is out, he is very difficult to put 

back in again. 

The cosmological constant immediately found a role in reconciling the age of the 
Earth with the expansion age of the Universe as given by the inverse of Hubble’s 
constant Hq 1 . If the cosmological constant is zero, all Friedman models of the Uni- 
verse have ages less than /7 (| 1 . At that time, Hubble’s estimate of Hq was about 
500 km s 1 Mpc 1 , corresponding to // (| ! = 2 x 10 9 years. This time-scale was less 
than the age of the Earth as determined by nucleocosmochronology, that is, from 
dating using long-lived radioactive isotopes. A positive value of the cosmological 
constant can resolve this discrepancy since its effect is to stretch out the expansion 
time-scale of the Universe, a picture advocated by Arthur Eddington and Lemaitre. 
It turned out that Hubble’s estimate of Hq was seriously overestimated and, follow- 
ing revisions in the 1950s by Walter Baade and Allan Sandage, this conflict was 
eliminated. Despite the fact that the cosmological constant appeared to be no longer 
necessary, it made regular appearances in the literature to account for various fea- 
tures of cosmological data, but these pieces of evidence were not compelling. Then, 
during the period 1995 to 2005, convincing evidence for a positive value of the cos- 
mological constant was found from studies of very distant Type la supernovae and 
from determinations of the power spectrum of fluctuations in the Cosmic Microwave 
Background Radiation. 

As the standard models of General Relativity became better understood, a major 
thrust of cosmological research became the determination of the large-scale dynami- 
cal and geometrical properties of the Universe - its rate of expansion, its deceleration, 
its mean density, its geometry and its age. These remained among the most diffi- 
cult programmes of modern observational cosmology until, in the first years of the 
twenty-first century, precise estimates became available using techniques undreamt 
of by the pioneers of geometrical cosmology. 
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1.4 The Big Bang 

The next major advance occurred soon after the Second World War when George 
Gamow realised that, in an expanding Universe, the early stages must have been 
very hot indeed - the temperature was so high that the dynamics of the expansion 
were dominated by the energy density of thermal radiation rather than by its matter 
content, in other words, the Universe was radiation-dominated. Following an earlier 
suggestion of Lemaitre, he attempted to explain the origin of the chemical elements 
by primordial nucleosynthesis, that is, by nuclear fusion processes as the Universe 
cooled down from its very hot initial stages. The reasons for adopting this picture 
were twofold. Firstly, following the work of Cecilia Payne, the abundances of the 
chemical elements in stars seemed to be remarkably uniform and secondly it was 
thought that the central temperatures of the stars were not high enough for nucle- 
osynthesis to take place. Gamow’s programme was not successful because of the 
problem of synthesising elements heavier than helium - there are no stable isotopes 
with atomic mass numbers 5 and 8. Therefore, in the short time-scales available 
in the hot early phases of the expansion, there was not time to synthesis elements 
heavier that helium. Gamow’s coworkers Ralph Alpher and Robert Herman showed 
that only deuterium, helium-3 and helium-4 were created in significant quantities 
(Fig. 1.6) (Alpher and Herman, 1950). 

In the course of their calculations, Alpher and Herman worked out the thermal 
history of the Universe in some detail and predicted that there should be present in 
the Universe today a diffuse background of black-body radiation with temperature 
about 5 K, the cooled remnant of its very hot early phases (Alpher and Herman, 
1948). The detection of this background radiation was far beyond the capabilities 
of the technology of the 1940s and the lack of success of Gamow’s programme of 
primordial nucleosynthesis resulted in the neglect of this key prediction for many 
years. Furthermore, in the 1950s, Fred Hoyle discovered the triple-a resonance, 
which leads to the formation of carbon from three helium nuclei (Hoyle, 1954). Soon 
after, he and his colleagues, Margaret Burbidge, Geoffrey Burbidge and William 
Fowler, showed how the heavy elements could be accounted for by nucleosynthesis 
in stars (Burbidge et al., 1957). 

Interest in what is now referred to as the Big Bang model of the Universe grew 
steadily through the 1950s and early 1960s as evidence was found for cosmological 
evolutionary effects in the distribution of faint radio sources (Ryle, 1955, 1958). On 
the theoretical side, interest was rekindled in the question of the synthesis of elements 
in the early Universe, not now with a view to creating all the elements, but rather 
to account for the cosmic abundance of helium. By 1964, it was appreciated that, 
wherever helium could be observed in the Universe, it is present with a very high 
chemical abundance, about 24% by mass. This figure far exceeded what could be 
explained by stellar nucleosynthesis. I remember vividly attending a course of post- 
graduate lectures given by Fred Hoyle in Cambridge in 1964 entitled Problems of 
Extragalactic Astrophysics in which this problem was discussed. During the lecture 
course, Hoyle, Roger Tayler, and John Faulkner carried out detailed computations of 
the expected abundance of helium produced by primordial nucleosynthesis. Within 
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Fig. 1.6. The evolution of the 
fraction (by number) of the light 
nuclei in a radiation-dominated 
Universe, according to calcula- 
tions by Fermi and Turkevich and 
published by Alpher and Herman 
in 1950 (Alpher and Herman, 
1950). The models began with 
100% of the material in the form 
of neutrons. The tritium 'H and 
neutrons shown surviving to 
2000 seconds decay radioactively 
with half-lives of 12.46 years and 
10.25 minutes respectively 
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a week of the topic being raised, they had shown that about 23 to 25% of helium by 
mass is created by this process and that the percentage is remarkably independent 
of the precise initial conditions. The paper by Hoyle and Tayler was published in 
Nature in 1964 (Hoyle and Tayler, 1964). Subsequent more detailed calculations by 
Robert Wagoner, Fowler and Hoyle confirmed these conclusions and suggested that 
other elements which are difficult to account for by stellar nucleosynthesis, the light 
isotope of helium, 3 He, deuterium, D, and lithium, 7 Li, could also be accounted for in 
this way (Wagoner et al., 1 967). Equally important, the success of these computations 
resulted in an upper limit to the mean baryon mass density of the Universe of about 
one tenth the critical density - if the density were any higher, less than the observed 
abundances of deuterium and helium-3 would be created primordially. 

By the early 1960s, as the sensitivity of receivers for centimetre wavelengths 
improved, it became feasible to search for the cool background radiation left over 
from the early stages of the Big Bang. The predicted remnant of the Big Bang was 
discovered, more or less by accident, by Arno Penzias and Robert Wilson in 1965 
(Penzias and Wilson, 1965). The Cosmic Microwave Background Radiation was the 
second key discovery of twentieth century observational cosmology. Observations 
by the Cosmic Background Explorer (COBE), launched in 1989, showed that, away 



16 



1 A Brief History of Cosmology and Galaxy Formation 



from the Galactic plane, the radiation is uniform over the sky to better than one part 
in 100,000 on angular scales greater than 7° and that its spectrum is of black-body 
form with a quite remarkable precision (Smoot et ah, 1992; Fixsen et al., 1996). 
These observations provided compelling evidence that our Universe went through 
a very hot, dense phase when the matter and radiation were in thermal equilibrium 
in its early stages. 

The upshot of these discoveries was that there were four independent pieces of 
evidence for the Big Bang picture of the origin and evolution of our Universe. Firstly, 
the expansion of the distribution of galaxies discovered by Hubble; secondly, the 
black-body spectrum and isotropy of the Cosmic Microwave Background Radia- 
tion; thirdly, the formation of the light elements by primordial nucleosynthesis; and 
fourthly, the fact that the ages of the oldest stars and nucleochronology ages were of 
the same order as the expansion age of the Universe. Thus, the Big Bang provided 
a natural framework within which to tackle the problems of galaxy and structure 
formation. 



1.5 Galaxy and Structure Formation 

The Friedman world models are isotropic and homogeneous and so the enormous 
diversity of structure we observe in the Universe today is absent. The next step 
in developing more realistic models of the Universe is to include small density 
perturbations into the homogeneous, isotropic models and study their development 
under gravity. For the case of a stationary medium, this problem was solved by 
James Jeans in 1902 (Jeans, 1902). The criterion for collapse is that the size of the 
perturbation should exceed the Jeans’ length 7.j = q/(G’0o/jt) l/2 , where c s is the 
speed of sound in the medium and qq its density. On scales greater than the Jeans’ 
length, the instability grows exponentially. The physical meaning of the instability 
criterion is that, on large enough scales, the gravitational force of attraction by the 
matter of the perturbation exceeds the pressure gradients which resist collapse. 

The analysis was repeated for the case of an expanding medium in the 1930s by 
Lemaitre and by Richard Tolman for the case of spherically symmetric perturbations 
(Lemaitre, 1933; Tolman, 1934) and the solution for the general case was found 
by Evgenii Lifshitz in 1946 (Lifshitz, 1946). Lifshitz found that the condition for 
gravitational collapse is exactly the same as the Jeans’ criterion at any epoch but, 
crucially, the growth-rate of the density perturbations is no longer exponential but 
only algebraic. For a Universe with the critical density, J2 0 = 1 or go — 3H^/8itG, 
the density contrast A — Sq/q grows with time as Zl oc t 2 '\ The implication of this 
result is that the fluctuations from which the large-scale structure of the Universe 
formed cannot have grown from infinitesimal random perturbations. For this reason, 
Lemaitre, Tolman and Lifshitz inferred that galaxies could not have formed by 
gravitational collapse. 

From the early 1960s onwards, other authors took the point of view that the 
solution to the problem was to include finite perturbations into the model of the early 
Universe and then follow in detail how their mass spectrum would evolve with time. 
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The Moscow school led by Yakov Zeldovich, Igor Novikov and their colleagues and 
James Peebles at Princeton pioneered this approach to the study of the development 
of structure in the Universe. If perturbations on a particular physical scale are tracked 
backwards into the past, at some large redshift, the scale of the perturbation is equal 
to the horizon scale, that is r ct, where t is the age of the Universe. In 1964, 
Novikov showed that, to form structures on the scales of galaxies and clusters of 
galaxies, the density perturbations on the scale of the horizon had to have amplitude 
A = Sq/q ~ 10 4 in order to guarantee the formation of galaxies by the present 
epoch (Novikov, 1964). These were certainly not infinitesimal perturbations and 
their origin had to be ascribed to processes occurring in the very early Universe. 

The discovery of the Cosmic Microwave Background Radiation in 1965 had an 
immediate impact upon these studies since the thermal history of the pre-galactic 
gas could be worked out in detail and this was essential in order to determine 
how the speed of sound, and hence the Jeans’ length, varied with cosmic epoch. If 
there is no energy input into the background radiation, the temperature of the thermal 
background radiation changes with scale factor a as T — To/a = 7o(l+z), where z is 
redshift, exactly as in the adiabatic expansion of a photon gas. Therefore, at redshifts 
Z ~ 1500, the temperature of the radiation was about 4000 K, at which temperature 
there were sufficient photons in the Wien region of the Planck distribution to ionise 
all the intergalactic hydrogen. This epoch is referred to as the epoch of recombination 
and at earlier epochs the hydrogen was fully ionised; at a correspondingly earlier 
epoch, the primordial helium was ionised as well. Somewhat earlier than the epoch 
of recombination, the inertial mass density of the radiation was equal to the mass 
density of the matter, qc 2 — aT 4 , and so, at times earlier than the epoch of matter 
and radiation equality, the dynamics of the Universe were radiation-dominated. 

The coupling of matter and radiation by electron scattering was worked out by 
Ray Weymann in 1966 and in much more detail by Zeldovich and Rashid Sunyaev 
in 1969 (Weymann, 1966; Zeldovich and Sunyaev, 1969). The pioneering papers by 
Zeldovich and Sunyaev were based upon the theory of induced Compton scatter- 
ing which had been published by Aleksander Kompaneets in 1956, long after this 
remarkable classified work had been completed (Kompaneets, 1956). What these 
papers showed was that, during the radiation-dominated epochs, the matter and radi- 
ation were maintained in very close thermal contact by Compton scattering as long 
as the intergalactic gas remained ionised. This enabled the speed of sound to be 
determined at all epochs before the epoch of recombination. Therefore, the evolu- 
tion of the Jeans’ length and the mass of baryonic matter within this length, what is 
known as the Jeans’ mass, could be evaluated. 

In 1968, Joseph Silk showed that, during the pre-recombination epochs, sound 
waves in the radiation-dominated plasma were damped by repeated electron scat- 
terings (Silk, 1968). The effect of this damping was to dissipate fluctuations with 
masses less than about 10 I2 M Q , a mass known as the Silk mass, by the epoch of 
recombination. Consequently, all fine-scale structure would be wiped out and only 
large-scale structures on the scale of large galaxies and clusters of galaxies could 
form after recombination. In the early 1970s, Zeldovich and Edward Harrison inde- 
pendently put together information about the spectrum of the initial fluctuations on 
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different physical scales and showed that observed structures in the Universe could 
be accounted for if the mass fluctuation spectrum had the form A(M) oc M 2/3 in 
the very early Universe, corresponding to a power spectrum of initial fluctuations of 
the form \Ak\ 2 oc k" with n — 1. The amplitude of this scale-free power spectrum, 
known as the Harrison— Zeldovich spectrum of initial perturbations , was inferred to 
be ~ 1(T 4 (Harrison, 1970; Zeldovich, 1972). 

A key test of these models was provided by the fact that density fluctuations at 
the epoch of recombination should leave some imprint upon the intensity distribution 
of the Cosmic Microwave Background Radiation on the sky. In the simplest picture, 
if the process of recombination were instantaneous, adiabatic perturbations would 
be expected to result in temperature fluctuations AT/T = | Af/c 2 = \Aq/q on 
large physical scales associated with large-scale gravitational perturbations, an ef- 
fect known as the Sachs-Wolfe effect (Sachs and Wolfe, 1967). In fact, the problem 
is somewhat more complicated than this, partly because the process of recombi- 
nation is not instantaneous and because other physical processes come into play 
on angular scales of about 1° and less. These include the adiabatic compression 
of the perturbations and first-order Doppler scattering due to the collapse of the 
primordial perturbations. These predictions provided a challenge for the observers 
since the amplitudes of the temperature fluctuations in these early theories were in 
the range AT/T ~ 1 0 3 — 10 4 , well within the capability of sensitive anisotropy 
measurements of the Cosmic Microwave Background Radiation. 

In the 1970s, these concepts gave rise to two principal scenarios for the formation 
of structure in the Universe. The first, known as the adiabatic model, was based upon 
a picture in which the perturbations were adiabatic sound waves before the epoch 
of recombination and structure in the Universe formed by the fragmentation of 
large-scale structures which reached amplitude Sq/q ~ 1 at relatively late epochs. 
A realisation of this scenario was described by Andrei Doroshkevich, Sunyaev and 
Zeldovich in 1974 (Doroshkevich et al., 1974). 

An alternative picture was one in which the perturbations were not sound waves 
but isothermal perturbations in pressure balance with the background radiation in 
the pre-recombination plasma. Small mass perturbations were not damped in this 
picture and so perturbations on all scales survived to the recombination epoch. After 
that epoch, the Jeans’ mass dropped to about 10 6 M Q , corresponding roughly to 
the masses of globular clusters. Galaxies and clusters of galaxies then formed by 
the process of hierarchical clustering under the influence of perturbations on larger 
physical scales. 

Both models predicted similar amplitudes for the density perturbations at the 
epoch of recombination on large physical scales and consequently similar temper- 
ature perturbations in the Cosmic Microwave Background Radiation. Their subse- 
quent behaviour was, however, quite different. The adiabatic picture could be thought 
of as a ‘top-down’ process of galaxy formation in which the largest scale structures 
formed first and then smaller scale structures formed by a process of fragmentation. 
In contrast, the isothermal picture corresponded to a ‘bottom-up’ process in which 
small-scale objects came together to form larger structures by hierarchical cluster- 
ing. In the adiabatic picture, galaxies, stars and the chemical elements all formed at 
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relatively late epochs, whereas in the isothermal picture, they could begin to form at 
very much earlier cosmic epochs. 

Throughout the 1970s increasingly sensitive searches were made for temperature 
fluctuations in the Cosmic Microwave Background Radiation, these observations 
being analysed critically by Bruce Partridge in his review of 1980 (Partridge, 1980a). 
His own observations had reached sensitivities of AT/T ~ 1 0 4 or slightly better 
by that time (Partridge, 1980b). Models with low density parameters were in serious 
conflict with these upper limits because, in these, there is relatively little growth 
of the perturbations after the epoch of recombination. Thus, by the early 1980s, 
the upper limits to the intensity fluctuations in the Cosmic Microwave Background 
Radiation were beginning to constrain severely purely baryonic theories of structure 
formation. Furthermore, the limits to the density parameter in the form of baryons 
from primordial nucleosynthesis arguments showed that, if the density of matter in 
the Universe were close to the critical density, most of the matter in the Universe 
would have to be in some non-baryonic form. 



1.6 Hot and Cold Dark Matter 

A solution to these problems appeared in 1980 when Valentin Lyubimov and his 
collaborators reported experiments which suggested that the electron neutrino had 
a finite rest mass of about 30 eV (Lyubimov et al., 1980). In 1966, Semion Gershtein 
and Zeldovich had noted that relic neutrinos of finite rest mass could make an ap- 
preciable contribution to the mass density of the Universe (Gershtein and Zeldovich, 
1966) and, in the 1970s, Gyorgi Marx and Alex Szalay had considered the role of 
neutrinos of finite rest mass as candidates for the dark matter, as well as studying 
their role in galaxy formation (Marx and Szalay, 1972). The intriguing aspect of 
Lyubimov’s result was that, if the relic neutrinos had this rest mass, the Universe 
would just be closed, = 1 . 

Zeldovich and his colleagues developed a new version of the adiabatic model in 
which the Universe was dominated by neutrinos with finite rest mass (Doroshkevich 
et al., 1980). Neutrino fluctuations would begin to grow as soon as they became non- 
relativistic but, since the neutrinos are very weakly interacting particles, they would 
stream freely out of the perturbations and so small-scale density perturbations would 
be quickly damped out. The closely coupled matter and radiation density fluctuations 
would oscillate at a low level during the pre-recombination era but, after recombi- 
nation, the baryonic matter would fall into the larger amplitude neutrino fluctuations 
and then evolve more or less as in the standard adiabatic scenario. Because of the 
free-streaming of the neutrinos, however, only the very largest scale perturbations 
with masses M > 10 16 M Q would survive to the epoch of recombination and so, just 
as in the adiabatic model, the largest scale perturbations would form first and then 
smaller scale structures form by the process of fragmentation. This model had the 
great advantage of reducing very significantly the expected amplitude of temperature 
fluctuations in the Cosmic Microwave Background Radiation since the perturbations 
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in the baryonic matter would be of low amplitude during the critical phases when 
the background photons were last scattered. 

In 1970, Zeldovich discovered a solution for the non-linear development of 
a collapsing cloud and used it to show that large-scale perturbations would collapse 
into sheets and pancakes which resemble the large-scale filamentary structure seen 
in the distribution of galaxies (Zeldovich, 1970). This scenario for galaxy formation 
became known as the Hot Dark Matter picture since the neutrinos were relativistic 
when they decoupled from the primordial plasma. 

There were, however, concerns about this picture. First of all, there were reser- 
vations about the experiments which claimed to have measured the rest mass of the 
electron neutrino and it appears that Lyubimov’s results were erroneous - the present 
upper limit to the rest mass of the electron neutrino is a few electronvolts. Secondly, 
constraints could be set to the masses of the neutrinos if they were to constitute the 
dark matter in galaxies, groups and clusters of galaxies. In 1979, James Gunn and 
Scott Tremaine showed how the phase space constraints associated with fermions 
such as neutrinos could be used to set lower limits to their masses (Tremaine and 
Gunn, 1979). While 30 eV neutrinos could bind clusters and the haloes of giant 
galaxies, those needed to bind dwarf galaxies would have to have masses much 
greater than 30 eV. This was not necessarily a fatal flaw because it could be that 
some other form of dark matter was present in the haloes of the dwarf galaxies. 

There was also the realisation about this time that there were several alternative 
possibilities for the dark matter which came from theories of elementary particles. 
Examples included the axions, supersymmetric particles such as the gravitino or 
photino and ultraweakly interacting neutrino-like particles, all of which would be 
relics of the very early Universe. The period 1980 to 1982 marked the period when the 
particle physicists began to take the early Universe very seriously as a laboratory for 
particle physics. According to James Peebles, Richard Bond introduced the term Cold 
Dark Matter in 1982 to encompass many of the exotic types of particle suggested by 
particle physicists. The matter was ‘cold’ in the sense that these particles decoupled 
from the thermal background after they had become non-relativistic. 

The Cold Dark Matter scenario is similar in many ways to the isothermal model. 
Since the matter is very cold, perturbations are not destroyed by free streaming. 
Fluctuations on all scales can survive and so, when the pre-recombination Universe 
became matter dominated, these perturbations began to grow, decoupled from the 
matter and radiation. As in the Hot Dark Matter scenario, after the epoch of recom- 
bination, the baryonic matter collapsed into the growing potential wells in the dark 
matter and galaxies, groups and clusters formed by hierarchical clustering. In 1982, 
Peebles demonstrated how the presence of such particles could reduce the ampli- 
tude of the predicted fluctuations in the Cosmic Microwave Background Radiation 
to levels consistent with the observational upper limits (Peebles, 1982). A remark- 
ably useful formalism for the process of hierarchical clustering was described by 
William Press and Paul Schechter in 1974 which gives a good description of how the 
mass function of objects of different masses evolves with time (Press and Schechter, 
1974). 



1 .6 Hot and Cold Dark Matter 



21 



These alternative dark matter scenarios for galaxy formation were the subject 
of a great deal of analysis and computer simulation during the 1980s. The Hot 
Dark Matter picture tended to predict too much power in large-scale structures, 
while the Cold Dark Matter models predicted too little (Frenk, 1986). One of the 
most important predictions of these models was that temperature fluctuations in the 
Cosmic Microwave Background Radiation should be detected at the level of about 
one part in 10 5 and these were detected in 1992 by the COBE satellite. Fluctuations 
on angular scales 0 >7° were discovered with amplitude AI/I ~ 10~ 5 by George 
Smoot and his colleagues (Smoot et al., 1992). These fluctuations correspond to 
physical dimensions about ten times the size of the largest holes and voids observed 
in the distribution of galaxies. On these large angular scales, the source of intensity 
fluctuations is the Sachs-Wolfe effect, that is, the gravitational redshift associated 
with photons originating from within the density fluctuations at the last scattering 
surface (Sachs and Wolfe, 1967). It can be shown that the Harrison-Zeldovich 
spectrum with n = 1 results in temperature fluctuations which are independent of 
angular scale for all scales greater than a few degrees, consistent with the COBE 
observations. 

The Cold Dark Matter model became the preferred picture for galaxy and struc- 
ture formation, but it needed patching up to achieve consistency with all the obser- 
vations. Viable models were constructed which include a tilted power spectrum of 
the initial fluctuations as compared with the standard Harrison-Zeldovich spectrum, 
others included the cosmological constant or decaying neutrinos, yet others consid- 
ered a mixture of Hot and Cold Dark Matter and others considered that the Universe 
might be open. All the models included biasing of the distribution of visible matter 
relative to that of the dominant dark matter, which defined the large-scale structure 
of the distribution of galaxies (Turok, 1997). 

The picture changed dramatically in the final years of the twentieth century and 
the first few years of the twenty-first. Firstly, the Type la supernovae were found 
to be excellent ‘standard candles’ for the estimation of cosmological distances. 
The redshift-magnitude relation for these supemovae strongly suggested that the 
cosmological constant was not zero, but had a large positive value, the best-fitting 
value corresponding to a density parameter of what became known as the dark energy 
of ~ 0.7 (Knop et al., 2003; Tonry et al., 2003). 

Equally important were experiments to determine the detailed power spectrum 
of temperature fluctuations in the Cosmic Microwave Background Radiation which 
were predicted to display prominent maxima on angular scales less than 1°. The 
predictions were refined in the context of the baryonic adiabatic and isothermal 
models by Sunyaev and Zeldovich in 1970 (Sunyaev and Zeldovich, 1970). It was 
realised that these acoustic fluctuations contain a great deal of information about 
the large-scale properties of the Universe and a number of experimental groups 
made very large efforts to pin down the exact shape of their power spectrum. Strong 
evidence for these oscillations were found in these experiment, but these endeav- 
ours were largely superseded by the first-year results of the Wilkinson Microwave 
Anisotropy Probe (Bennett et al., 2003). These defined in exquisite detail the power 
spectrum of the fluctuations in the Cosmic Microwave Background Radiation and 
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enabled remarkably precise estimates of cosmological parameters to be made, par- 
ticularly when combined with the results of large-scale galaxy surveys such as the 
Anglo-Australian Telescope 2dF Survey and the Sloan Digital Sky Survey. These 
observations demonstrated beyond doubt that the large-scale geometry of the Uni- 
verse is very close to flat, that the density parameter in dark and baryonic matter 
is close to 0.28 and that the density parameter in the dark energy is close to 0.72 
(Tegmark et al., 2004). What is particularly impressive about these results is that 
they are entirely consistent with many independent astronomical estimates of the 
cosmological parameters. This has given rise to the concept of the concordance 
values of the cosmological parameters, a set of parameters which, within the quoted 
uncertainties, are in agreement with all the best estimates of their values. 

From the perspective of galaxy and structure formation, the good news was that 
these achievements incorporate naturally the formation of structure according to the 
standard /l COM model with no biasing. Equally impressive was the fact that the 
determination of the two-point correlation function for galaxies determined from 
the large-scale galaxy surveys now overlapped the corresponding angular scales 
in the Cosmic Microwave Background Radiation and that these were in excellent 
agreement. In the most recent analyses, evidence has been found for a maximum in 
the two-point correlation function for galaxies corresponding to the first peak in the 
power spectrum of perturbations in the Cosmic Microwave Background Radiation. 

The upshot of these remarkable developments is that we have now entered the 
era of precision cosmology in which cosmological parameters can be estimated 
with confidence to better than 5% and much deeper cosmological questions can be 
addressed by the present and future generations of observations and experiments. 



1.7 The Very Early Universe 

Despite the undoubted success of the concordance model, it raises as many problems 
as it solves. The picture is incomplete in the sense that, within the context of the 
standard world models, the initial conditions have to be put in by hand in order to 
create the Universe as we observe it today. How did these initial conditions arise? The 
resolution of these problems will undoubtedly give insight into the laws of physics 
under physical conditions which at the moment can only be studied by cosmological 
observations. 

- The horizon problem. This problem can be restated, ‘Why is the Universe so 
isotropic?’ (Dicke, 1961). At earlier cosmological epochs, the particle horizon 
r ~ ct encompassed less and less mass and so the scale over which particles 
could be causally connected became smaller and smaller. A vivid example of 
this problem is to work out how far light could have travelled along the last 
scattering layer at z — 1000 since the Big Bang. Regions of the sky separated by 
angular distances greater than 2° could not have been in causal communication. 
Why then is the Cosmic Microwave Background Radiation so isotropic? 
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The flatness problem. This problem arises from the fact that, according to the 
standard world models, if the Universe were set up with a value of the density 
parameter differing even slightly from the critical value £2q — 1 , it would diverge 
very rapidly from T?o = 1 at later epochs (Dicke, 1 96 l;Dicke and Peebles, 1979). 
There is nothing in the standard world models that would lead us to prefer any 
particular value of T?o- Why then in the density parameter close to one today? 
This is sometimes referred to as the fine-tuning problem. 

The baryon asymmetry problem. The baryon asymmetry problem arises from 
the fact that the photon-to-baryon ratio today is N y /Nb ~ 10 9 . If photons were 
neither created or destroyed, this ratio is conserved as the Universe expands. 
At temperature T ~ 10 10 K, electron-positron pair production takes place from 
the photon field. At a correspondingly higher temperature, baryon-antibaryon 
pair production takes place with the result that there must have been a very 
small asymmetry in the baryon-antibaryon ratio in the very early Universe if 
we are to end up with the correct photon-to-baryon ratio at the present day. 
If the Universe had been symmetric with respect to matter and antimatter, the 
photon-to-baryon ratio would now be about 10 18 , in gross contradiction with the 
observed value (Zeldovich, 1965). Therefore, there must be some mechanism 
in the early Universe which results in a slight asymmetry between matter and 
antimatter. 

The primordial fluctuation problem. What was the origin of the density fluctu- 
ations from which galaxies and large-scale structures formed? The amplitudes 
of the density perturbations when they came through the horizon had to be of 
finite amplitude, A = 8q/q ~ 10 4 , on a very wide range of mass scales. These 
cannot have originated as statistical fluctuations in the numbers of particles on, 
say, the scales of superclusters of galaxies. There must have been some physical 
mechanism which generated finite amplitude perturbations with power spectrum 
close to P(k) oc k in the early Universe. 

The values of the cosmological parameters. The horizon and flatness problems 
were recognised before compelling evidence was found for the finite value of 
the cosmological constant, or in modern parlance, the density parameter of the 
dark energy Q a- The Universe seems to be geometrically flat and so the sum 
of the density parameters in the matter and the dark energy must sum to unity, 
T2a + — 0.72 + 0.28 = 1 . Even if the sum of these two parameters were not 

precisely unity, it is a surprise that the two parameters are of the same order of 
magnitude at the present epoch because the matter density evolves with redshift 
as (1 + z) 3 , while the dark energy density parameter is unchanging with cosmic 
epoch. Why then do we live at an epoch when they have more or less the same 
value? 

A further problem concerns the present value of the density parameter of the 
dark energy T2a which can be estimated using simple concepts from quantum 
field theory. The value found is about 10 12 ° times greater than permissible values 
at the present epoch. This is quite a problem, but it should not be passed over 
lightly. If the inflationary picture of the very early Universe is taken seriously, 
this is exactly the type of force which drove the inflationary expansion. 



24 



1 A Brief History of Cosmology and Galaxy Formation 



- The nature of dark matter and dark energy. As if these problems were not serious 
enough, they are compounded by the fact that the nature of the dark matter and 
the dark energy are unknown. One of the consequences of precision cosmology 
is the troubling result that we do not understand the nature of about 95% of the 
material which drives the large-scale dynamics of the Universe. 

The first suggestion that some of these problems might be resolved by appeal 
to particle physics was made by Sakharov in 1967 who suggested that the baryon- 
antibaryon asymmetry might be associated with the type of symmetry-breaking 
observed in the decays of the K mesons, in other words, that the asymmetry is 
associated with the type of symmetry-breaking which occurs in Grand Unified 
Theories of elementary particles in the early Universe (Sakharov, 1967). 

The most important conceptual development for contemporary cosmology came 
in 1981 with Alan Guth’s proposal of the inflationary model for the very early Uni- 
verse (Guth, 1981). There had been earlier suggestions foreshadowing his proposal. 
For example, Zeldovich had noted in 1968 that there is a physical interpretation of 
the cosmological constant A associated with the zero-point fluctuations of a vac- 
uum. Andrei Linde in 1974 and Sydney Bludman and Malvin Ruderman in 1977 had 
shown that the scalar Higgs fields, which had been introduced to give the W ± and 
Z° particles mass, have similar properties to those which would result in a positive 
cosmological constant. 

In Guth’s paper of 1981, he realised that if the Universe went through an early 
exponential expansion phase, this would solve both the problem of the isotropy of 
the Universe on a large scale and would also drive the Universe towards a flat spatial 
geometry. The effects of the exponential expansion is to drive neighbouring particles 
apart at an exponentially increasing rate so that, although they were in causal contact 
in the very early Universe, the exponential inflation quickly moves them far beyond 
their local horizons and can account for the large-scale isotropy of the Universe by 
the end of the inflation epoch. The exponential expansion also straightens out the 
geometry of the Universe, however curved it may have been in its initial stages. At the 
end of this phase of exponential inflation, the Universe transforms into the standard 
Friedman world model, which, since it has very precisely flat geometry, must have 
T?o = 1. In Guth’s original picture, the transformation to the Friedman solution 
took place through a first-order phase transition but this created too many magnetic 
monopoles. The model was revised in 1982 by Linde and by Andreas Albrecht and 
Paul Steinhardt who showed how the transition to the Friedman solutions could be 
smooth and continuous and so avoid many of the problems associated with Guth’s 
proposal (Linde, 1982, 1983; Albrecht and Steinhardt, 1982). 

The original hope that a physical realisation for the inflationary expansion could 
be found within the context of particle physics beyond the standard model has not 
been achieved, but the underlying concepts of the inflationary picture have been 
used to define the necessary properties of the inflaton potential needed to create the 
Universe as we know it. 

Since 1982, the inflationary scenario for the early evolution of the Universe 
between the epochs when it was only 1 0 34 to 10 12 seconds old has been studied 
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very intensively. Among the further successes claimed for the theory has been 
the realisation that quantum fluctuations in the fields which drive the inflation are 
also amplified during the inflationary era. In 1977, Stephen Hawking and Gary 
Gibbons worked out the important result that quantum fluctuations in expanding 
de Sitter space produce thermal radiation with a well-defined temperature (Gibbons 
and Hawking, 1977). This acted as a stimulus to apply similar ideas to the new 
inflationary picture with a view to estimating the perturbation spectrum. Following 
the 1982 Nuffield Workshop held in Cambridge, the key result was established that 
the spectrum of quantum fluctuations of the vacuum Higgs fields were scale-free and 
result naturally in adiabatic curvature perturbations with spectrum strikingly similar 
to the Harrison-Zeldovich spectrum with n ~ 1 (Gibbons et ah, 1983). According 
to Andrew Liddle and David Lyth: 

Although introduced to resolve problems associated with the initial condi- 
tions needed for the Big Bang cosmology, inflation’s lasting prominence 
is owed to a property discovered soon after its introduction. It provides 
a possible explanation for the initial inhomogeneities in the Universe that 
are believed to have led to all the structures we see, from the earliest objects 
formed to the clustering of galaxies to the observed irregularities in the 
microwave background (Liddle and Lyth, 2000). 



2 The Large-Scale Structure of the Universe 



Our current picture of how matter and radiation are distributed in the Universe on 
a large scale is derived from a wide variety of different types of observation. In 
this chapter, we concentrate upon the large-scale distribution of matter and radiation 
in the Universe and discuss galaxies and clusters of galaxies in Chaps. 3 and 4 
respectively. The observations described in this chapter provide much of the essential 
underpinning of modern cosmological research. 



2.1 The Spectrum and Isotropy of the Cosmic Microwave 
Background Radiation 

On the very largest scales, the best evidence for the overall isotropy of the Universe 
is provided by observations of the Cosmic Microwave Background Radiation. This 
intense diffuse background radiation in the centimetre, millimetre and submillimetre 
wavebands was discovered in 1965 by Penzias and Wilson whilst commissioning 
a sensitive maser receiver system for centimetre wavelengths at the Bell Telephone 
Laboratories (Penzias and Wilson, 1965). It was soon established that this radiation is 
remarkably uniform over the sky and that, in the wavelength range 1 m > X > 1 cm, 
the intensity spectrum had the form I v <x v 2 , corresponding to the Rayleigh-Jeans 
region of a black-body spectrum at a radiation temperature of about 2.7 K. 

The maximum intensity of a black-body spectrum at a radiation temperature of 
2.7 K occurs at a wavelength of about 1 mm at which atmospheric emission makes 
precise absolute measurements of the background spectrum from the surface of the 
Earth very difficult indeed. During the 1970s and 1980s several high-altitude balloon 
experiments carrying millimetre and submillimetre spectrometers were flown and 
evidence found for the expected turn-over in the Wien region of the spectrum, 
but there were discrepancies between the experiments (Weiss, 1980). The only 
satisfactory approach for determining the detailed spectrum and isotropy of the 
Cosmic Background Radiation over the whole sky was to place the receiver system 
in a satellite above the Earth’s atmosphere and this was achieved by the Cosmic 
Background Explorer (COBE) of NASA which was launched in November 1989. 
This mission was dedicated to studies of the background radiation, not only in the 
millimetre and submillimetre wavebands, but also throughout the infrared waveband 
from 2 to 1000 p.m. 
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Fig. 2.1. The final spectrum of the Cosmic Microwave Background Radiation as measured by 
the COBE satellite (Fixsen et al., 1996). The units of the abscissa are inverse centimetres, so 
that 10 units corresponds to 1 mm and 5 to 2 mm. Uncertainties are a small fraction of the line 
thickness. Within the quoted errors, the spectrum is precisely that of a perfect black-body at 
radiation temperature T = 2.728 ± 0.002 K 



2.1.1 The Spectrum of the Cosmic Microwave Background Radiation 

The Far Infrared Absolute Spectrophotometer (FIRAS) of COBE measured the 
spectrum of the Cosmic Microwave Background Radiation in the wavelength range 
0.5 to 2.5 mm with very high precision during the first year of the mission. The 
FIRAS detectors and a reference black-body source were cooled to liquid helium 
temperatures and there was sufficient liquid cryogen for only one year of observation. 
Particular care was taken over the thermometry involved in making such absolute 
temperature measurements. The final spectrum shown in Fig. 2. 1 is that of a perfect 
black-body with a radiation temperature T — 2.728 ± 0.002 K (Fixsen et al., 
1996). More quantitatively, the deviations from a perfect black-body spectrum in the 
wavelength interval 2.5 > X > 0.5 mm amounted to less than 0.03% of the maximum 
intensity. This is the most beautiful example I know of a naturally occurring black- 
body radiation spectrum. 

There are two convenient ways of describing the degree to which the observed 
spectrum differs from that of a perfect black-body spectrum, both of them pioneered 
by Zeldovich and Sunyaev in the late 1960s (for details of their work, see their review 
of 1980 (Sunyaev and Zeldovich, 1980a)). They showed that the injection of large 
amounts of thermal energy in the form of hot gas into the intergalactic medium can 
produce various types of distortion of the black-body radiation spectrum because of 
Compton scattering of the background photons by hot electrons. We will not go into 
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the physics of these processes at this point, except to note the forms of distortion 
and the limits which can be set to certain characteristic parameters. 

If there were early injection of thermal energy prior to the epoch when the 
primordial plasma recombined at a redshift of about 1000, and if the number of 
photons was conserved, the spectrum would relax to an equilibrium Bose-Einstein 
spectrum with a finite dimensionless chemical potential //, 



The simplest way of understanding this result is to note that the Bose-Einstein 
distribution is the equilibrium distribution for photons when there is a mismatch 
between the total energy and the number of photons over which this energy is to 
be distributed. In the case of a black-body spectrum, both the energy density and 
number density of photons are determined solely by the temperature T t . In contrast, 
the Bose-Einstein distribution is determined by two parameters, the temperature T t 
and the dimensionless chemical potential /i. 

In the case of Compton scattering by hot electrons at late epochs, the energies of 
the photons are redistributed about their initial values and, to second order, there is 
an increase in their mean energies so that the spectrum is shifted to slightly greater 
frequencies. 1 In 1969, Zeldovich and Sunyaev showed that the distortion of the 
black-body spectrum takes the form 



where y is the Compton scattering optical depth y — f (kT t /m e c 2 ) ojN e &I, 
x = hv/kT r and oj is the Thomson scattering cross-section (Zeldovich and Sunyaev, 
1969). In the limit of small distortions, y 1, the intensity in the Rayleigh-Jeans 
region decreases as A I v /I v = —2 y and the total energy under the spectrum increases 
as e = eoe 4;y . 

Limits to the parameters y and /i have been derived from the very precise spectral 
measurements made by the FIRAS instrument. The results quoted by Page are as 
follows (Page, 1997): 



These are very strong limits indeed and will prove to be of astrophysical importance 
in the study of the physics of the intergalactic gas, as well as constraining the amount 
of star and metal formation which could have taken place in young galaxies. 

2.1.2 The Isotropy of the Cosmic Microwave Background Radiation 

Equally remarkable were the COBE observations of the isotropy of the distribution of 
the Cosmic Microwave Background Radiation over the sky. The prime instruments 

1 I have given a derivation of this result in my book High Energy Astrophysics, Vol. 1 

(Longair, 1997b). 




( 2 . 1 ) 




( 2 . 2 ) 



|y| < 1.5 x 10 5 , |/z| < 1CP 4 . 



(2.3) 
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for these studies were the Differential Microwave Radiometers which operated at 
frequencies of 31.5, 53 and 90 GHz, thus sampling the Rayleigh-Jeans region of the 
background spectrum. The angular resolution of the radiometers was 7°. The choice 
of observing frequency was crucial in these observations. At higher frequencies, 
the millimetre and submillimetre emission of diffuse Galactic dust at high galactic 
latitudes, often referred to as ‘cirrus’ , confuses the picture, whilst at lower frequencies 
the radio synchrotron radiation of ultrarelativistic electrons gyrating in the Galactic 
magnetic field becomes important. The final results of the four-year mission are 
shown in Fig. 2.2 at increasing levels of sensitivity (Bennett et al., 1996). 

Figure 2.2a illustrates the stunning result that the Cosmic Microwave Background 
Radiation is extraordinarily uniform over the whole sky with radiation temperature 
2.728 K. 

At a sensitivity level of about one part in 1000 of the total intensity, a large-scale 
anisotropy of dipolar form was observed over the whole sky (Fig. 2.2b). The plane 
of our Galaxy can also be observed as a faint band of emission along the Galactic 
equator. The global dipole anisotropy is naturally attributed to aberration effects 
associated with the Earth’s motion through an isotropic radiation field. Excluding 
regions close to the Galactic plane, the temperature distribution was found to have 
precisely the expected dipole distribution, T = 7o[l + ( v / c) cos 0], where 0 is the 
angle with respect to the direction of maximum intensity and v is the Earth’s velocity 
through the isotropic background radiation. The amplitude of the cosmic microwave 
dipole was 3.353 ± 0.024 mK with the maximum intensity in the direction towards 
galactic coordinates / = 264.25° ± 0.33°; b — 48.22° ±0.13° (Bennett et al., 1996). 
It was inferred that the Earth is moving at about 350 km s -1 with respect to the frame 
of reference in which the radiation would be 100% isotropic. It is significant that, 
although not designed to undertake this task, exactly the same form of large-scale 
dipole anisotropy was observed by the FIRAS instrument. 

The measurement of the velocity of the Sun relative to the Cosmic Microwave 
Background Radiation is an important result for understanding the large-scale dis- 
tribution of mass in the Universe. Once allowance is made for the motion of the Sun 
about the centre of our Galaxy, an estimate of the peculiar velocity of our Galaxy and 
the local group of galaxies relative to the frame of reference in which the background 
radiation would be perfectly isotropic can be found. This motion can be attributed to 
perturbations in the distribution of mass on very large scales in the relatively nearby 
Universe (Kolatt et al., 1995). 

On angular scales of 7° and greater, Bennett and his colleagues achieved sensi- 
tivity levels better than one part in 100,000 of the total intensity from analyses of 
the complete microwave data set obtained over the four years of the COBE mission 
(Fig. 2.2c). At this sensitivity level, the radiation from the plane of the Galaxy is 
intense, but is confined to a broad strip lying along the Galactic equator. Away from 
this region, the sky appears quite smooth on a large scale, but there are significant 
fluctuations in intensity from beamwidth to beamwidth over the sky. These fluctu- 
ations are present at the level of only about 1 part in 100,000 of the total intensity 
and, when averaged over the clear region of sky at \b\ > 20° amount to a root-mean- 
square amplitude of 35 ± 2 pK on an angular scale of 7°, or to 29 ± 1 pK when 
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C AT = 18 



Fig. 2.2a-c. Maps of the whole sky in galactic coordinates as observed at a wavelength of 
5.7 mm (53 GHz) by the COBE satellite at different sensitivity levels, a The distribution of total 
intensity over the sky. b Once the uniform component was removed, the dipole component 
associated with the motion of the Earth through the background radiation was observed, as 
well as a weak signal from the Galactic plane, c Once the dipole component was removed, 
radiation from the plane of the Galaxy was seen as a bright band across the centre of the 
picture. The fluctuations seen at high galactic latitudes were a combination of noise from 
the telescope and the instruments and a genuine cosmological signal. The rms value of the 
fluctuations at each point away from the Galactic equator amounted to 36 pK. When averaged 
statistically over the whole sky at high latitudes, an excess sky noise signal of cosmological 
origin of 30 ± 5 pK was detected (Bennett et al., 1996) 



smoothed to 10° angular scale. These values were found to be frequency independent 
for the three COBE frequency channels at 31.5, 53 and 90 GHz. The detection of 
these fluctuations is a crucial result for understanding the origin of the large-scale 
structure of the Universe. The COBE observations allow information to be obtained 
about the angular spectrum of the intensity fluctuations on all scales 0 > 7°. In 
Chap. 15, we will deal with the important cosmological information which can be 
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Fig. 2.3. A map of the whole sky in galactic coordinates as observed by the WMAP satellite 
at millimetre wavelengths (Bennett et al., 2003). The angular resolution of the map is about 
20 times higher than that of Fig. 2.2c. The emissions due to Galactic dust and synchrotron 
radiation have been subtracted from this map 



derived from observations of temperature fluctuations in the Cosmic Microwave 
Background Radiation on smaller angular scales. 

It is interesting to compare the COBE map (Fig. 2.2c) with the more recent 
WMAP observations of 2003 made with about 20 times higher angular resolution 
(Bennett et al., 2003) (Fig. 2.3). It can be seen that the same large scale features 
are present on both maps. In particular, regions of strong positive and negative 
fluctuations agree rather well. We will have a lot more to say about Fig. 2.3 in due 
course. 

The COBE observations are crucial for cosmology. From the point of view of 
the structure of the Universe on the very largest angular scales, they show that 
the Cosmic Microwave Background Radiation is isotropic to better than one part 
in 100,000. Whatever its origin, this observation in itself shows that the Universe 
must be extraordinarily isotropic on the large scale. As we will show, it is wholly 
convincing that this radiation is the cooled remnant of the very hot early phases of 
the Big Bang. 

How is the distribution of radiation related to the distribution of matter in the 
Universe? We will take up this topic in much more detail in Chap. 9, but it is useful to 
outline here how they are related. In the standard Big Bang picture, when the Universe 
was squashed to only about one thousandth of its present size, the temperature of 
the Cosmic Microwave Background Radiation was about one thousand times greater 
than it is now. The temperature of the background radiation varies with redshift z 
as T r = 2.728(1 + z) K and so, at a redshift z = 1500, the temperature of the 
radiation field was about 4000 K. At this temperature, there were sufficient Lyman 
continuum photons in the Wien region of the background spectrum to photoionise 
all the neutral hydrogen in the Universe. At this early epoch, known as the epoch 
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of recombination , galaxies had not formed and all the ordinary baryonic matter, 
which was eventually to become the visible matter of galaxies as we know them, 
was still in the form of remarkably smooth, partially ionised pre-galactic gas. At 
earlier epochs, the pre-galactic gas was fully ionised and was very strongly coupled 
to the background radiation by Thomson scattering. 

Therefore, when we look back to these epochs, it is as if we were looking at the 
surface of a star surrounding us in all directions, but the temperature of the radiation 
we observe has been cooled by a cosmological redshift factor of 1500, so that what we 
observe is redshifted into the millimetre waveband. This analogy makes it clear that, 
because of Thomson scattering of the background radiation, we can only observe the 
very surface layers of our ‘star’. We cannot obtain any direct information about what 
was going on at earlier epochs. This ‘surface’ at which the Universe became opaque 
to radiation is known as the last scattering surface and the fluctuations observed by 
COBE are interpreted as the very low intensity ripples present on that surface on 
angular scales of 7° and greater. These ripples grow under gravity and will eventually 
define some of the very largest scale structures in the local Universe. 

In the interpretation of the COBE observations described in the last paragraph, it 
was assumed that the intergalactic gas was transparent to radiation from the epoch of 
recombination onwards and was not reionised and heated at some later epoch. If that 
were to occur, the perturbations would be further damped by Thomson scattering 
and this has now been detected in the WMAP observations. However, the damping 
is not so great that features in the power spectrum of the fluctuations are wiped out. 

One important aspect of these studies is that the energy density of the Cosmic 
Microwave Background Radiation amounts to e ra d = uT 4 = 4.2 x 10 14 .1 m 3 = 
2.64 x 10 5 eV m 3 . This energy density of radiation pervades the whole Universe at 
the present epoch and provides by far the greatest contribution to the average energy 
density of the universal background radiation. 



2.2 The Large-Scale Distribution of Galaxies 

The visible Universe of galaxies is highly inhomogeneous, consisting of structures 
from the scale of isolated galaxies, through groups and clusters of galaxies to su- 
perclusters and giant voids in the distribution of galaxies. As we progress to larger 
and larger scales, the distribution of galaxies becomes smoother, but still contains 
significant non-random features. For many purposes, it is convenient to think of the 
galaxies as the building blocks of the Universe which define its large-scale structure. 

An excellent representation of the large-scale distribution of galaxies on the 
sky is shown in Fig. 2.4. This remarkable picture was created from scans of 185 
contiguous UK Schmidt plates, each of which covers an area of 6° x 6° on the 
sky, the scanning being carried out by the Cambridge APM high-speed measuring 
machine (Maddox et al., 1990). The image is centred on the South Galactic pole 
and so the effect of Galactic obscuration by dust on the distribution of galaxies is 
negligible. Each plate was carefully calibrated and stars distinguished from galaxies 
by their different image profiles. Figure 2.4 contains over two million galaxies with 
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Fig. 2.4. The distribution of galaxies with 17 < bj < 20.5 shown in an equal area projection 
centred on the South Galactic pole. This image was reconstructed from machine scans of 185 
UK Schmidt plates by the Cambridge APM measuring machine. There are over two million 
galaxies in this image. The small empty patches in the map are regions that have been excluded 
about bright stars, nearby dwarf galaxies, globular clusters and step wedges (Maddox et al., 
1990) 



apparent magnitudes in the range 17 <bj < 20.5 and so represents the distribution 
of galaxies on the sky on the grandest scale. 

It is apparent that, although one bit of Fig. 2.4 does not look too different from 
another on a large enough scale, the distribution of galaxies is far from uniform on 
a small scale. There appear to be clumps of galaxies, stringy structures and holes but, 
of course, the eye is expert at finding such structures in random data. Despite this 
concern, much of the obvious clumping, the clusters, holes and stringy structures 
are real features of our Universe. To demonstrate the reality of these features, the 
three-dimensional distribution of galaxies needs to be determined and so distances 
have to be measured for very large samples of galaxies. Although this is a really 
huge task, this has now been achieved thanks to the efforts of many astronomers. 

2.2.1 Two-Point Correlation Functions 

We need statistical methods appropriate for describing the clustering properties of 
galaxies on a wide range of scales and the simplest approach is to use two-point 
correlation functions. In the cosmological case, these can be described either in 
terms of the distribution of galaxies on the sky, or in terms of spatial two-point 
correlation functions in three dimensions. On the sky, we define the atigular two- 
point correlation function, w{6), by 



N{6) df2 = n g [l + w(9)] df2 , 



(2.4) 
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where w{0) describes the excess probability of finding a galaxy at an angular distance 
0 from any given galaxy. The term d T2 is the element of solid angle and n g is a suitable 
average surface density of galaxies. The term w{6) contains information about the 
clustering properties of galaxies to a given limiting apparent magnitude and can be 
measured with some precision from large statistical surveys of galaxies such as the 
Cambridge APM surveys which contain over two million galaxies (Fig. 2.4). Notice 
the important point that two-point correlation functions take circularly symmetric 
averages about each galaxy and so throw away all information about the ‘stringiness’ 
of the distribution of galaxies. Some of this more detailed structural information can 
be recovered using three- and four-point correlation functions, but let us begin with 
the two-point correlation functions. 

The homogeneity of the distribution of galaxies with increasing distance can be 
studied by measuring the angular two-point correlation function as a function of 
increasing apparent magnitude. If the galaxies are sampled from a homogeneous, 
but clustered, distribution, the angular two-point correlation function scales with 
increasing limiting distance D in local Euclidean space as 

D 0 ( D\ 

w(6,D) = -£w 0 le—\ , (2.5) 

where the function wq(6) has been determined to distance Dq. The factor 9(D/Dq) 
in the argument of wq takes account of the fact that a fixed scale subtends a smaller 
angle at a greater distance D and the factor ( Do/ D) in front of wq takes account 
of the fact that there are more background galaxies (oc Z) 3 ), but that the surface 
density of galaxies about any galaxy to a fixed physical scale increases only as D 2 . 
If the galaxies extend to distances D such that redshift effects need to be taken 
into account, it is necessary to integrate over the luminosity function of the galaxies 
counted and to use a Friedman world model to determine the spatial and surface 
number densities of galaxies (Groth and Peebles, 1977; Scranton et al., 2002). 

Such scaling analyses were carried out by Groth and Peebles who showed that 
the two-point correlation functions determined from a bright sample of Zwicky 
galaxies, from the Lick counts of galaxies and from a deep sky survey plate in an 
area known as the Jagellonian field scaled exactly as expected if the distribution 
of the galaxies displayed the same degree of spatial correlation throughout the 
local Universe out to z ~ 0.1 (Groth and Peebles, 1977, 1986). A similar result 
was found by comparing the two-point correlation functions found at increasing 
apparent magnitude limits in the machine-scanned surveys carried out by the APM 
group at Cambridge (Maddox et al., 1990). Figure 2.5a shows the angular two-point 
correlation functions w(6) measured at increasing apparent magnitude limits in the 
magnitude range 17.5 < m < 20.5. In Fig. 2.5b, these functions are scaled to the 
angular correlation function found from the Lick survey. 

More recently, the same type of analysis has been carried out for a large sample 
of galaxies from the Sloan Digital Sky Survey (SDSS) which extends to apparent 
magnitude r* = 23 by Connolly, Scranton and their colleagues. According to their 
estimates, the mean redshift in the magnitude interval 21 < r* < 22 is 0.43. Using 
the same scaling procedures with a proper Friedman cosmological model, they 
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Fig. 2.5a, b. The two-point correlation function for galaxies over a wide range of angular 
scales, a The scaling test for the homogeneity of the distribution of galaxies can be performed 
using the correlation functions for galaxies derived from the APM surveys at increasing 
limiting apparent magnitudes in the range 17.5 < m < 20.5. The correlation functions are 
displayed in intervals of 0.5 magnitudes, b The two-point correlation functions scaled to the 
correlation function derived from the Lick counts of galaxies (Maddox et al., 1990) 




Fig. 2.6a, b. The two-point correlation function for galaxies determined from the Sloan Digital 
Sky Survey (SDSS) (Connolly et al., 2002; Scranton et al., 2002). a The angular two-point 
correlation function determined in a preliminary analysis of 2% of the galaxy data contained 
in the Sloan Digital Sky Survey, b Comparison of the scaled angular two-point correlation 
functions found by Maddox and his colleagues from the APM galaxy survey ( solid line) with 
that found from the SDSS analysis 
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find excellent agreement with the angular two-point correlation function determined 
by Maddox and his colleagues (Fig. 2.6) (Connolly et al., 2002; Scranton et al., 
2002). It is noteworthy that a better match of the correlation functions is found for 
a concordance world model with L? () = 0.3 and Q a = 0.7 than for the critical model 
with Qq — 1 and £2 a — 0. 

These are important results for the construction of cosmological models. As 
expressed by Peebles (Peebles, 1993): 

. . . the correlation function analyses have yielded a new and positive test of 
the assumption that the galaxy space distribution is a stationary (statistically 
homogeneous) random process. 

Figures 2.5b and 2.6b illustrate the important point that the correlation function 
for galaxies is smooth, meaning that clustering is found on all angular scales with 
no prominent features on the scales of clusters or superclusters of galaxies. Of 
course, it needs to be remembered that the two-point correlation function is circularly 
symmetric about each galaxy and so wipes out a lot of detailed information. In 
Figs. 2.5b and 2.6b, w(6) can be characterised by a power law of the form 

w(ff) a <r (<) ■ 7 - 0 ■ 8, (2.6) 



with a cut-off on large angular scales. 

It is more meaningful physically to work in terms of the spatial two-point 
correlation function f(r) which describes the clustering properties of galaxies in 
three dimensions about any galaxy: 

N(r)dV = N 0 [l + $(r)]dV , (2.7) 



where N(r ) dV is the number of galaxies in the volume element dV at distance r 
from any galaxy and Nq is a suitable average space density, f (r) describes the excess 
number of galaxies at distance r from any given galaxy. 

In order to derive £(r) directly from observation, we need to know the distribution 
of galaxies in space. If, however, we make a number of reasonable assumptions, we 
can derive a simple formula which relates w(6) to §(r). Suppose a cluster of galaxies 
has radial number density distribution n{r) — «o[l + £(r)]. Then, it is a simple 
calculation to show that the projected distribution is given by the integral 
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(2.8) 



where a is the projected radial distance from the centre of the cluster and a max is 
the tidal radius of the cluster. The first term in the second integral for n(a) is just 
a constant. If we adopt a power law dependence for f (r), £(r) oc r~ y , the second term 
in the integral can be written in dimensionless form using the substitution r = ax as 
follows: 
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Allowing the upper limit of the integral to go to infinity, we see that the integral is 
a constant and that N(d) and £(r) are related by 

N(a) oc a~ (y ~ l) . (2.10) 

Consequently, in the region in which the angular two-point correlation function can 
be described by the power law relation (2.6), the function f (r) can be well represented 
by a power law of the form 



Sto = (£) V , (2.11) 

where y = 1.7 — 1.8. More detailed analyses of the relation between £(r) and w(0) 
show that the correlation function (2.11) is a good match to the data on physical 
scales from about 1 00/z 1 kpc to I Oh 1 Mpc in which the scale ro = 5/z 1 Mpc and 
the exponent y — 1.7 — 1.8. 2 On scales greater than about I 0/z 1 Mpc the two-point 
correlation function decreases more rapidly than the power law (2.11). Thus, on 
large enough scales, the amplitude of the clustering decreases dramatically and the 
Universe becomes isotropic on the very largest physical scales. 

We also obtain the important result that on physical scales r '^> 5/z 1 Mpc, the 
mean amplitude of the density perturbations is less than one and consequently density 
perturbations on larger scales are on average still in the linear regime Sq/q <$C 1 at 
the present epoch. 

2.2.2 Walls and Voids in the Distribution of Galaxies on Large Scales 

The analysis of Sect. 2.2.1 provides the simplest description of the distribution of 
galaxies on large scales, but it cannot describe the walls and voids in the distribution 
of galaxies seen in Fig. 2.4. The nature of these structures has been well defined by 
a number of large-scale redshift surveys for galaxies. 

One of the earliest complete samples of nearby galaxies is presented in Fig. 2.7 
which shows the local three-dimensional distribution of galaxies derived from the 
Flarvard-Smithsonian Astrophysical Observatory survey of over 14,000 bright galax- 
ies (Geller and Huchra, 1989). Our own Galaxy is located at the centre of the diagram 
and, if the galaxies were uniformly distributed in the local Universe, the points would 
be uniformly distributed over the diagram, which is certainly very far from the case. 
There are gross inhomogeneities and irregularities in the local Universe including 
large ‘holes’ or ‘voids’ in which the local number density of galaxies is signifi- 
cantly lower than the mean, and long ‘filaments’ or ‘walls’ of galaxies, including the 
feature known as the ‘Great Wall’, which extends from right ascensions 9 h to 17 h 

2 The use of h = 7/o/(100kms -1 Mpc -1 ) is a convenient device for adjusting the dimen- 
sions and luminosities of extragalactic objects to the reader’s preferred value of Hub- 
ble’s constant. If a value of Hq = 100 kms -1 Mpc -1 is preferred, h = 1; if the value 
Hq — 50 kms -1 Mpc -1 is adopted, h = 0.5 and so on. It is now known that the value of 
h is h = 0.72 ± 0.07. 
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Fig. 2.7. The distribution of galaxies in the nearby Universe as derived from the Harvard- 
Smithsonian Center for Astrophysics survey of galaxies. The map contains over 14,000 
galaxies which form a complete statistical sample around the sky between declinations 
S = 8.5° and 44.5°. All the galaxies have recession velocities less than 15,000kms _1 . 
Our Galaxy is located at the centre of the map and the radius of the bounding circle is 
150/t -1 Mpc. The galaxies within this slice have been projected onto a plane to show the 
large-scale features in the distribution of galaxies. Rich clusters of galaxies which are 
gravitationally bound systems with internal velocity dispersions of about 10 3 kms -1 appear 
as ‘fingers’ pointing radially towards our Galaxy at the centre of the diagram. The distribution 
of galaxies is highly irregular with huge holes, filaments and clusters of galaxies throughout 
the local Universe (Geller and Huchra, 1989) 

about half-way to the limit of the survey. There are a number of ‘streaks’ or ‘fingers’ 
pointing towards our own Galaxy and these correspond to clusters of galaxies, the 
lengths of the ‘streaks’ corresponding to the components of the velocity dispersion 
of the galaxies in the clusters along the line of sight. 
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Fig. 2.8a,b. The spatial distribution of galaxies on a large-scale. In both diagrams, the dis- 
tribution extends to a redshift z & 0.25. a A slice through the Anglo-Australian Telescope 
2dF Galaxy Survey (Colless et al., 2001) showing the pronounced ‘cellular’ structure of the 
distribution of galaxies on the large scale (image courtesy of the 2dFGRS Team), b The 
distribution of galaxies in the Sloan Digital Sky Survey, showing the same ‘cellular’ structure 
observed in the AAT 2dF survey (Stoughton et al., 2002) 
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These surveys have now been extended to much greater distances and the same 
overall picture emerges. Among the first of these surveys, the Las Campanas Redshift 
Survey sampled the distribution of galaxies to a distance about four times that of the 
Harvard-Smithsonian survey and included 26,418 galaxies (Lin et al., 1996). The 
sizes of the voids in the galaxy distribution were on roughly the same physical scale 
as those in Fig. 2.7, indicating that the Universe is homogeneous on a large enough 
scale, consistent with the scaling arguments from the angular two-point correlation 
functions. 

More recently, the statistics of such surveys has been increased by an order 
of magnitude by the Anglo- Australian Telescope Two-degree Field (2dF) survey of 
galaxies and the first results of the Sloan Digital Sky Galaxy Survey (Fig. 2.8a and b). 
In the 2dF survey, the redshifts of over 200,000 galaxies were measured. To reveal the 
‘cellular’ structure more clearly, only a narrow wedge 4° wide is shown in Fig. 2.8a 
which includes 56,237 galaxies (Colless et al., 2001). The map shown in Fig. 2.8b 
from the Sloan galaxy survey also includes over 200,000 galaxies (Stoughton et al., 
2002). In both cases, the surveys extend to redshifts of about 0.25 and it can be seen 
that the ‘cellular structure persists out to the limits of the surveys. Notice that this 
means that these surveys have already mapped out a large fraction of the distribution 
of galaxies at the present epoch. 

In Figs. 2.7 and 2.8a and b, the scales of the largest holes are about 30-50 times 
the scale of a cluster of galaxies, that is, up to about 50 h~ l Mpc. These are the largest 
known structures in the Universe and one of the major cosmological challenges is 
to reconcile this gross irregularity in the large-scale spatial distribution of galaxies 
with the remarkable smoothness of the Cosmic Microwave Background Radiation 
seen in Fig. 2.2c. Despite the presence of the huge voids however, the amplitude of 
these irregularities decreases with increasing scale so that on the very largest scales, 
one bit of Universe looks very much like another. 

It is important to have a quantitative description of the large-scale topology of 
the galaxy distributions shown in Figs. 2.7 and 2.8a and b. In the 1980s, Gott and 
his colleagues developed techniques for evaluating the topology of the distribution 
of voids and galaxies from large redshift surveys (Gott et al., 1986; Melott et al., 
1988). As they expressed the issue delightfully in their paper: 

We would like to know whether the distribution of galaxies on large scales 
is best described as a hierarchy of clusters, an irregular lattice of cells or 
‘bubbles’, a network of filaments, or a set of non-intersecting filaments. 
Loosely speaking, one might describe the topology of these models as 
respectively a ‘meatball’ topology, a ‘swiss-cheese’ topology, a ‘sponge’ 
topology and a ‘spaghetti’ topology. 

The importance of these studies is that the topology of the distribution of galaxies 
is intimately related to the initial conditions from which the large-scale structure 
formed, in particular, to the common assumption that the perturbations were Gaussian 
fluctuations with random phases. 

From an analysis of the CfA galaxy survey, Gott and his colleagues found that 
the distribution of the galaxies on the large scale is ‘sponge-like’, the material of 



42 



2 The Large-Scale Structure of the Universe 



the sponge representing the location of the galaxies and the holes in the sponge 
corresponding to the large voids (Gott et al., 1986; Melott et al., 1988). Both the 
holes and the distribution of galaxies are continuously connected throughout the 
local Universe. This topology is possible in three dimensions but not in two. Just 
like a sponge, overall the distribution of material of the sponge and the holes is 
homogeneous, but, on a small scale, it is highly inhomogeneous. 

Similar analyses have been carried out for the large AAT 2dF and SDSS Galaxy 
surveys by Hoyle, Vogeley, Gott and their colleagues (Hoyle et al., 2002a, b). They 
find that the overall topology of the two independent surveys are in remarkable 
agreement and similar to that of a Gaussian random field. They go further and show 
how further astrophysically important issues can be addressed by considering the 
topologies of red and blue galaxies separately and comparing these with simulations 
of the formation of structure in the concordance ACDM model. We will return to 
these issues in Part IV. 

Another way of investigating the large-scale distribution of discrete objects in 
the Universe is to study the distribution of extragalactic radio sources over the 
sky. Unlike the optical waveband, it turns out that, when a survey of the radio sky 
is made, the objects which are easiest to observe are extragalactic radio sources 
associated with certain rare classes of active galaxy, the radio quasars and radio 
galaxies, at very great distances. Because they are rare objects, they sample the 
isotropy of the Universe on a very large scale. Figure 2.9 shows the distribution 
of the brightest 54,579 extragalactic radio sources at a wavelength of 6 cm in the 
Greenbank Catalogue of radio sources which spans most of the northern hemisphere 
(Gregory and Condon, 1991; Kooiman et al., 1995). Besides the hole about the North 
Celestial pole, there are holes in the vicinity of a few intense radio sources. There 
is also a small excess of sources lying along the Galactic plane but otherwise the 
distribution is entirely consistent with the sources being distributed uniformly at 
random over the sky on the large scale. 

Kooiman and his colleagues report weak clustering on angular scales 1-2°, 
consistent with an angular two-point correlation function of the form w(6) oc 0 0-8 . 
A much stronger angular two-point correlation signal was found in the much deeper 
FIRST survey carried out with the VLA by Helfand and his colleagues (Cress et al., 
1996). At a limiting flux density of 1 mJy at 1.4 GHz, 138,665 radio sources were 
detected and an angular two-point correlation function of the form w(6) oc 0 u 
measured down to angular separations of a tenth of a degree. 

The radio sources are ideal for probing the large-scale distribution of discrete ob- 
jects since they are readily observed at large distances. The bulk of the radio sources 
plotted in Fig. 2.9 lie at redshifts z > 1 and so they sample the distribution of discrete 
sources on the largest physical scales accessible to us at the present epoch. Notice 
that the extragalactic radio sources provide complementary information to that pro- 
vided by the Cosmic Microwave Background Radiation, in that they refer to the 
large-scale distribution of discrete objects, such as galaxies, once they have formed. 

On fine scales, the clustering of galaxies takes place on a very wide variety 
of scales from pairs and small groups of galaxies, such as the Local Group of 
galaxies, to giant clusters of galaxies, such as the Coma and Pavo clusters which can 
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Fig. 2.9. The distribution of radio sources in the Greenbank Catalogue of radio sources at 6 cm 
(Gregory and Condon, 1991). The picture includes 54,579 radio sources with angular sizes 
less than 10.5 arcmin and flux densities 5 > 25 mJ. The area about the North Celestial pole 
was not surveyed. There are ‘holes’ in the distribution about a few bright radio sources and 
a small excess of sources associated with the Galactic plane. Otherwise, the distribution does 
not display any significant departure from a random distribution on a large scale, although 
there is evidence for clustering on the scale of 1° (Kooiman et al., 1995) 



contain thousands of members - we discuss some of their properties in Chap. 4. The 
rich regular clusters are self-gravitating bound systems, but there are also irregular 
clusters which have an irregular, extended appearance and it is not so clear that these 
are bound systems. 

The term supercluster is used to describe structures on scales larger than those 
of clusters of galaxies. They may consist of associations of clusters of galaxies, 
or a rich cluster with associated groups and an extended distribution of galaxies. 
Some authors would classify the ‘stringy’ structures seen in Figs. 2.7 and 2.8 as 
superclusters, or supercluster cells. From the physical point of view, the distinction 
between the clusters and the superclusters is whether or not they are gravitationally 
bound. Even in the rich, regular clusters of galaxies, which have had time to relax 
to a state of dynamical equilibrium, there has only been time for individual galaxies 
to cross the cluster up to about 10 times in the age of the Universe and so, on larger 
scales, there is scarcely time for the systems to become gravitationally bound. Our 
own Galaxy and the Local Group of galaxies are members of what is known as the 
Local Supercluster. This is the huge flattened distribution of galaxies centred on the 
Virgo cluster, which lies at a distance of about 15-20Mpc from our own Galaxy. It 
can be seen very prominently in maps of the distribution of bright galaxies running 
more or less perpendicular to the plane of the Galaxy and is outlined by filled circles 
in Fig. 2.10 (Kolatt et al., 1995). 



44 



2 The Large-Scale Structure of the Universe 




Fig. 2.10. An equal area projection drawn in galactic coordinates of the distribution of bright 
galaxies over the whole sky. In the north, the galaxies are taken from the UGC catalogue 
and, in the south, from the ESO, UGC and MCG catalogues. There is an absence of galaxies 
at low galactic latitudes because of extinction by interstellar dust. The prominent band of 
galaxies intersecting the Galactic plane at right angles at / ~ 320° is the Local Supercluster 
of Galaxies and the Supergalactic plane is delineated by the filled circles. Some regions in 
which prominent clustering of galaxies is found are labelled by the constellations in which 
these lie (Kolatt et al., 1995) 



We conclude that, on the very largest scales, the distribution of matter and 
radiation is remarkably isotropic and homogeneous. This greatly simplifies the con- 
struction of cosmological models. 



2.3 Hubble’s Law and the Expansion of the Universe 

Hubble made his great discovery of the velocity-distance relation for galaxies in 
1929 (Hubble, 1929). A modern version of Hubble’s law, in the form of an redshift- 
apparent magnitude relation or Hubble diagram, is shown in Fig. 2. 1 1 for the brightest 
galaxies in clusters (Sandage, 1968). It is found empirically that the brightest galaxies 
in nearby clusters all have more or less the same intrinsic luminosities and so their 
apparent magnitudes can be used to estimate relative distances by application of the 
inverse square law. 

For a class of galaxy of fixed intrinsic luminosity L, the observed flux density S 
is given by the inverse square law, S = L /4jtr 2 and so, converting this relation into 
astronomical apparent magnitudes m using the standard relation in — constant — 
2.5 log 10 S, it follows that 



m — 5 log 10 r + constant . 



(2.12) 
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Fig. 2.11. A modem version of the velocity-distance relation for galaxies for the brightest 
galaxies in rich clusters of galaxies. This correlation indicates that the brightest galaxies in 
clusters have remarkably standard properties and that their velocities of recession from our 
own Galaxy are proportional to their distances (Sandage, 1968) 



This was the approach adopted by Hubble and Humason in their pioneering analysis 
of 1934 (Hubble and Humason, 1934) - they assumed that the 5th brightest galaxy 
in a cluster would have more or less the same intrinsic luminosity (Fig. 1.5b). In 
Fig. 2.11, the corrected apparent magnitude in the V waveband is plotted against 
the logarithm of the redshift of the brightest galaxies in a number of rich clusters 
of galaxies which span a wide range of redshifts. The redshift z is defined by the 
formula 



z = 



7-obs k c 
^em 



(2.13) 



where A em is the emitted wavelength of some spectral feature and L 0 b S is the wave- 
length at which is it observed. In the limit of small velocities, v <$C c, if the redshift 
is interpreted in terms of a recessional velocity v of the galaxy, v — cz and this 
is the type of velocity plotted in the velocity-distance relation. It is an unfortunate 
tradition in optical astronomy that the splendidly dimensionless quantity, the redshift 
Z, is converted into a velocity by multiplying it by the speed of light. As we will see 
below, interpreting the redshift in terms of a recessional velocity leads to confusion 
and misunderstanding of its real meaning in cosmology. It is best if all mention of 
recessional velocities are expunged in developing the framework of cosmological 
models. 



46 



2 The Large-Scale Structure of the Universe 



The solid line shown in Fig. 2.11 is m — 51og 10 z + constant, corresponding 
to v oc r, and it runs precisely through the observed points - correlations do not 
come any better than this in cosmology. The velocity-distance relation is normally 
written v — liar, where Hq is known as Hubble’s constant. The velocity-distance 
relation appears to hold good for all classes of extragalactic system, including the 
active galaxies and quasars. 

We discussed in detail recent evidence for the homogeneity of the distribution of 
galaxies in space in Sect. 2.2. In fact, Hubble realised in the 1930s that a simple test 
of the homogeneity of the Universe is provided by the number counts of galaxies. 
As we will show ins Sect. 17.2.1, in a homogeneous Universe it is expected that the 
number counts of galaxies follow the law N(> S ) oc S 3/2 , where S is the observed 
flux density of the galaxy. This result is independent of the luminosity function of the 
sources so long as the counts do not extend to such large distances that the effects 
of the cosmological redshift have to be taken into account. In terms of apparent 
magnitudes, this relation becomes N(< m) oc I O 0 6 "' . 

Hubble found that the counts of galaxies to about 20th magnitude more or 
less followed this relation, although they showed some convergence at the faintest 
apparent magnitudes, which Hubble interpreted as evidence for the effects of space 
curvature at large distances (Hubble, 1936). More recent counts of galaxies are shown 
in Fig. 2.12 which extend to very faint apparent magnitudes (Metcalfe et al., 1996). 
The results are similar to those of Hubble at m < 20; the counts are slightly flatter 
than the Euclidean predictions, but are entirely consistent with the expectations of 
uniform world models once the effects of observing the populations at significant 
cosmological distances are taken into account. Divergences from the expectations 
of the uniform models occur at much fainter blue apparent magnitudes ( B > 22), in 
the sense that there is an excess of faint blue galaxies; we will take up the origin of 
this excess in detail in Chap. 17. 

The combination of the observed large-scale isotropy and homogeneity of the 
Universe with Hubble’s law shows that the Universe as a whole is expanding uni- 
formly at the present time. Let us show this formally by the following simple 
calculation. Consider a uniformly expanding system of points (Fig. 2.13). The def- 
inition of a uniform expansion is that the distances between any two points should 
increase by the same factor in a given time interval, that is, we require 



n (U) _ r 2 (t 2 ) 
r\(t i) r 2 (t\) 



r n (t 2 ) 

= ■ ■ ■ = a = constant , (2.14) 

r n (h ) 



for any set of points. Let us select some galaxy at random and take the distances of 
the other galaxies from it to be n, r 2 , . . . . Then, the recession velocity of galaxy 1 
relative to the chosen origin is 



_ nfe) - n(fi) _ n(h) r n fe ) 
h-h t 2 - h\_r x {ti) 

r\ (u) 

= — (a - 1) = H 0 r\(ti) . 

12 ~ 1 1 



(2.15) 
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Magnitude 

Fig. 2.12. The counts of faint galaxies observed in the B, I and K wavebands compared with the 
expectations of various uniform world models, as well as other models in which various forms 
of the evolution of the luminosity function of galaxies with redshift are assumed (Metcalfe 
et al., 1996). The galaxy counts follow closely the expectations of uniform world models at 
magnitudes less than about 22, but there is a excess of galaxies in the B and I wavebands at 
fainter magnitudes 



Similarly, for the nth galaxy, 

v n = - 1) = . (2.16) 

h — 1 1 

Thus, a uniformly expanding distribution of galaxies automatically results in 
a velocity-distance relation of the form v oc r. 

This analysis is, however, much deeper than simply an explanation of the local 
velocity-distance relation. Notice that the above analysis applies to galaxies at all 
distances in a uniformly expanding universe. The requirements of isotropy and 
homogeneity mean that the same linear velocity-distance relation must hold true 
at all distances , including at distances at which the recession velocities exceed the 
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Fig. 2.13. Illustrating the origin of the velocity-distance relation for an isotropically expanding 
distribution of galaxies. The distribution of galaxies expands uniformly between the epochs 
t\ and t 2 - If, for example, we consider the motions of the galaxies relative to the galaxy A, 
it can be seen that galaxy C travels twice as far as galaxy B between the epochs t\ and tj 
and so has twice the recession velocity of galaxy B relative to A. Since C is always twice the 
distance of B from A, it can be seen that the velocity-distance relation is a general property 
of isotropically expanding universes 



speed of light. A good way of understanding this assertion is to consider the familiar 
example of the surface of an expanding balloon as an analogue for the kinematics of 
an expanding universe. As the balloon is blown up, the distances between points on 
the sphere increase and everywhere a velocity-distance relation is obtained locally. 
When the balloon is blown up to a huge size, widely separated points on the sphere 
separate at speeds which can be greater than the speed of light. There is, however, 
nothing unphysical about this since there is no causal connection between the points. 
The points are simply partaking in a geometrically uniformly expansion which takes 
place over enormous distances. We will return to this important point in Chap. 12. 



2.4 Conclusion 

The upshot of the considerations of this chapter is that the correct starting point 
for the construction of models for the large-scale dynamics of the Universe is that 
they should be isotropic and homogenous on the large scale and that they should be 
uniformly expanding. These are enormous simplifications and, taken in conjunction 
with the General Theory of Relativity, provide a set of simple world models which 
provide the framework within which we can study the problems of the origin of 
galaxies and the other large-scale structures we observe in the Universe today. 



3 Galaxies 



3.1 Introduction 

‘Galaxy formation’ is the title of this book and so we should summarise what 
is known about the properties of galaxies. This is not a trivial business because 
normal galaxies are complex, many-body systems. Typically, a galaxy can consist 
of hundreds of millions or billions of stars, it can contain considerable quantities 
of interstellar gas and dust and can be subject to environmental influences through 
interactions with other galaxies and with the intergalactic gas. Star formation takes 
place in dense regions of the interstellar gas. To complicate matters further, it is 
certain that dark matter is present in galaxies and in clusters of galaxies and the 
dynamics of galaxies are largely dominated by this invisible dark component. Its 
nature is, however, unknown. 

Until the advent of the massive surveys of galaxies, the 2dF Galaxy Survey 
undertaken by the Anglo-Australian Telescope and the Sloan Digital Sky Survey 
(SDSS), the typical properties of galaxies were defined by meticulous morphological 
studies of large samples of bright galaxies. Being bright and relatively nearby, the 
morphological classification schemes had to encompass a vast amount of detail and 
this was reflected in Hubble’s pioneering studies as elaborated by de Vaucouleurs, 
Kormendy, Sandage, van den Bergh and others. In contrast, the sheer size of the 
new galaxy samples, which each encompass about 200,000 galaxies, has meant that 
classification schemes had to be based upon parameters which could be derived from 
computer analysis of the galaxy images and spectra. What the new approach loses in 
detail, it more than makes up for in the huge statistics involved and in the objective 
nature of the classification procedures. 

These recent developments have changed the complexion of the description of 
the properties of galaxies. While the new samples provide basic global information 
about the properties of galaxies, the old schemes describe many features which 
need to be incorporated into the understanding of the detailed evolution and internal 
dynamics of particular classes of galaxy. The upshot is that, we need to develop 
in parallel both the traditional and more recent approaches to the classification of 
galaxies. We will summarise some of their more important properties, as well as 
elucidating some of the essential physics. In the next chapter, we will perform 
a similar exercise for clusters of galaxies. The books Galaxies in the Universe: an 
Introduction by Sparke and Gallagher, Galactic Astronomy by Binney and Merrifield 
and Galactic Dynamics by Binney and Tremaine can be thoroughly recommended 
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as much more thorough introductions to these topics (Sparke and Gallagher, 2000; 
Binney and Merrifield, 1998; Binney and Tremaine, 1987). The results of the 2dF 
Galaxy Survey and the Sloan Digital Sky Survey (SDSS) are too recent to have 
entered the textbooks. We begin with the traditional approach and then relate these 
studies to the more global approach adopted in recent analyses. 



3.2 The Revised Hubble Sequence for Galaxies 

Let us first describe the traditional approach to the classification of galaxies, noting 
points of contact with more recent statistical approaches. Galaxies come in a be- 
wildering variety of different shapes and forms. In order to put some order into 
this diversity, classification schemes were devised on the basis of their visual ap- 
pearances, or morphologies, originally on photographic plates but nowadays from 
digital images taken with CCD cameras. The basis of the traditional morphological 
schemes remains the Hubble Sequence of Galaxies , described in Hubble’s mono- 
graph The Realm of the Nebulae (Hubble, 1936). The Hubble sequence, sometimes 
referred to as a ‘tuning-fork’ diagram, arranges galaxies into a continuous sequence 
of types with elliptical galaxies at the left-hand end and spirals at the right-hand end 
(Fig. 3.1). The spiral galaxies are ordered into two branches named ‘normal’ and 
‘barred’ spirals. Conventionally, galaxies towards the left-hand end of the sequence 
are referred to as ‘early-type’ galaxies and those towards the right as ‘late-type’ 
galaxies, reflecting Hubble’s original prejudice concerning their evolution from one 
type to another. Despite the fact that these ideas have long outlived their usefulness, 
the terms are still in common use, even in the era of the massive surveys of galaxies. 

Morphological classification schemes such as the Hubble sequence become an 
integral part of astrophysics when independent properties of galaxies are found to 
correlate with the morphological classes. This has been found to be the case for 




Fig. 3.1. The Hubble sequence of galaxies as presented in The Realm of the Nebulae (Hubble, 
1936) 
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a number of the overall properties of galaxies such as their integrated colours, the 
fraction of the mass of the galaxy in the form of neutral and molecular gas, and so on 
(Sect. 3.8). The vast majority of galaxies can be accommodated within the revised 
Hubble Sequence of Galaxies, which is described in detail by de Vaucouleurs, 
Sandage, Kormendy and van den Bergh (de Vaucouleurs, 1974; Sandage, 1975; 
Kormendy, 1982; van den Bergh, 1998). Van den Bergh emphasised that the classical 
Hubble types refer primarily to intrinsically luminous galaxies and that, in addition, 
there exists a large population of intrinsically low luminosity dwarf galaxies which 
can only be observed relatively nearby. There are also various other categories of 
galaxies with special characteristics, for example, the Seyfert galaxies, cD galaxies, 
N galaxies, radio galaxies, starburst galaxies and so on. Many of these types of 
galaxy contain active galactic nuclei. 

Elliptical galaxies E. These galaxies show no structural features in their brightness 
distributions but have an elliptical appearance, as if they were spheroids or ellipsoids 
of revolution (Fig. 3.2a). In absolute magnitude, elliptical galaxies range from among 
the most luminous galaxies known, having Mb —24, to dwarf ellipticals (dE), 
which are found in the Local Group of galaxies. In Hubble’s notation, the observed 
ellipticity of the galaxy is included in the morphological designation according to 
the rule that the number 10 x (a — b)/a was written after the letter E, where a 
and b are the observed major and minor axes of the ellipse. Thus E0 galaxies are 
circular and E7 galaxies, the most extreme ellipticities found in elliptical galaxies, 
have b/a — 0.3. Galaxies flatter than E7 all show a distinct disc and bulge structure 
and hence are classified as lenticular (SO) rather than E galaxies. 

Spiral galaxies S, SA, SB. The characteristic feature of spiral galaxies is their disk- 
like appearance with well-defined spiral arms emanating from their central regions 
(Fig. 3.2b and 3.3a). Very often the spiral pattern is double with a remarkable degree 
of symmetry with respect to the centre of the galaxy, but many more complicated 
configurations of spiral structure are known. The light distribution of what Hubble 
termed ‘normal’ spiral galaxies (or SA galaxies) can be decomposed into a central 
bulge or spheroidal component, similar in character to an elliptical galaxy, and a disk 
component, within which the spiral arms lie. In the case of the barred spirals (or 
SB galaxies), the central bulge has an elongated or ellipsoidal appearance, the spiral 
arms originating from the ends of the bar (Fig. 3.3a). There are as many ‘barred’ 
spiral galaxies as ‘normal’ spirals and, furthermore, there are just as many spirals 
intermediate between these two classes. 

Spiral galaxies are classified as Sa, Sb, Sc according to the following criteria, in 
decreasing order of importance; (1) the openness of the winding of the spiral arms, 
(2) the degree of resolution of the arms into stars and (3) the size of the spheroidal 
component or central bar relative to the disk component. Thus, 

- Sa galaxies have tightly wound spiral arms which are smooth showing no reso- 
lution into stars. The central bulge or bar is dominant, shows no structure and is 
unresolved into star clusters. 

- Sb galaxies have more open spiral arms, which show resolution into stars. The 
central spheroidal component or bar is generally smaller than in Sa galaxies. 
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Fig. 3.2a, b. Examples of different types of normal galaxy, a The elliptical galaxy M87 (NGC 
4486), one of the most luminous galaxies in the nearly Virgo cluster of galaxies. This image 
shows its famous optical jet originating in the bright nucleus of the galaxy which contains 
a black hole of mass M ~ 10 9 M 0 



- Sc galaxies have very open spiral arms which are patchy and are resolved into 
star clusters and regions of ionised hydrogen. The spheroidal component is very 
small. In barred spiral galaxies, the bar is resolved into clusters and HII regions 
and is not as prominent as in classes Sa or Sb. 

- The revised Hubble scheme extends this classification beyond Sc to include 
‘nearly chaotic’ structures which would have been classified as very late Sc 
spirals in the standard sequence but are now as classified Sd spirals. 

These morphological classes are rather broad and intermediate stages along the 
sequence are defined as Sab, She and Scd. It is found that, within a given class of 
spiral galaxy, the importance of the bulge can vary considerably. It is interesting 
to compare this approach with the more recent statistical approach in which the 
bulge-to-disc ratio is one of the key measurable parameters. 




3.2 The Revised Hubble Sequence for Galaxies 



53 




Fig. 3.2. (continued) b The spiral galaxy M51 (NGC 5194) and its nearby dusty companion 
(NGC 5195). (Images courtesy of NASA, ESA and the Hubble Heritage Team (STScI/AURA)) 



Lenticular galaxies SO or L. All galaxies with smooth light distributions and axial 
ratios b/a < 0.3 show evidence of a disk-like component and these are called 
lenticular (lens-like) or SO galaxies (Fig. 3.3b). They are similar to spiral galaxies 
in that their light distributions can be decomposed into a central bulge, similar in 
properties to elliptical galaxies, and an extensive disk. The lenticular galaxies appear 
intermediate in morphological type between elliptical and spiral galaxies. 

In many cases, the central bulges of the SO galaxies have a bar-like appearance 
and hence, as in the case of the spirals, they can be divided into ‘ordinary’ and 
‘barred’ lenticulars as well as intermediate types. In a number of lenticular galaxies, 
there is evidence for obscuring matter, often in the form of rings as can be seen in the 
example of NGC 1300 in Fig. 3.3b. In the revised Hubble classification, lenticular 
galaxies which are free of obscuring matter are termed ‘early’ S0~ with stages SO 0 
and S0 + representing ‘later’ stages with increasing amounts of obscuring material. 
By the intermediate stage between lenticular and spiral galaxies, SO/a, the obscuring 
matter begins to show what is referred to as ‘incipient spiral structure’. 

Irregular galaxies. In Hubble’s original classification, irregular galaxies were sys- 
tems Tacking both dominating nuclei and rotational symmetry’ and the class in- 
cluded everything which could not be readily incorporated into the standard Hubble 
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Fig. 3.3a, b. Examples of different types of normal galaxy, a The barred spiral galaxy NGC 
1300 



sequence. Many of these irregulars were similar to the companion galaxies of our 
own Galaxy, the Magellanic Clouds, and these were designated Irr I or Magellanic 
irregulars. There remained a small class of irregulars consisting of galaxies such as 
M82 , NGC 520 and NGC 3077, in which there was no evidence of resolution into 
stars; these galaxies were classified Irr II galaxies. 

Evidence that the Irr I galaxies form a natural extension of the Hubble sequence 
was provided by de Vaucouleurs’ discovery of weak but definite spiral structure in 
the Large Magellanic Cloud (LMC). Galaxies like the LMC can be considered to 
belong to stages in the Hubble sequence later than Sd and are denoted Sm. Thus, the 
late stages of the sequence reads: Scd, Sd, Sdm, Sm, Im. The Irr II systems find no 
natural place in the revised sequence and are designated 10 by de Vaucouleurs. The 
characteristics of the 10 irregular galaxies are that they are very rich in interstellar 
matter and contain young stars and active regions of star formation; a number of 
these would be classified as starburst galaxies. 

In the revised Hubble sequence, shown in tabular form in Table 3.1, the vari- 
ous stages along the sequence are assigned numbers ranging from —6 to 11. All 
transitions along the sequence are smooth and continuous. The frequencies with 
which different types of galaxy are found among catalogues of bright galaxies are 
shown in Tables 3.2 and 3.3 (de Vaucouleurs 1963). Striking features of this table 
are the large percentage of lenticular galaxies and the roughly equal proportions of 
normal, barred and intermediate spiral galaxies. The latter statistics indicate that, in 
well over half the known examples of spiral galaxies, there are bar-like structures 
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Fig. 3.3. (continued) b The SBO or lenticular galaxy NGC 2787. (Images courtesy of NASA, 
ESA and the Hubble Heritage Team (STScI/AURA)) 



Table 3.1. The revised Hubble sequence of galaxies according to de Vaucouleurs" classification 
(de Vaucouleurs, 1974) 



-6 -5 -4 -3 -2 -1 0 1 2 3 4 5 6 7 8 9 10 11 

E" E° E+ SO” SO 0 S0+ SOa Sa Sab Sb Sbc Sc Scd Sd Sdm Sm Im 10 



in their central regions and this has important implications for the origin of spiral 
structure. 

The figures given in Tables 3.2 and 3.3 must be treated with considerable caution. 
First of all, the galaxies included in the above statistics are those present in bright 
galaxy catalogues. They therefore refer to objects of a very wide range of intrinsic 
luminosities, from among the most luminous galaxies known, such as M87, to nearby 
dwarf galaxies. Secondly, as mentioned above, the classical Hubble types refer 
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Table 3.2. The frequencies with which galaxies of different morphological types are found 
among samples of bright galaxies (de Vaucouleurs, 1963) 



Class of Galaxy 


E 


L 


S 


Im 


10 


Pec 


Total 


Number 


199 


329 


934 


39 


13 


14 


1528 


Percentage 


13.0 


21.5 


61.1 


2.55 


0.85 


0.9 


100 



Table 3.3. The frequencies of different subtypes among 994 spiral galaxies (de Vaucouleurs, 
1963) 



Class 
of Galaxy 


0/a 


a 


ab 


b 


be 


c 


cd 


d 


dm 


m 


? 


Total 


Percentage 


SA 


17 


25 


25 


57 


57 


82 


30 


9 


3 


4 


2 


311 


31.3 


SAB 


13 


15 


23 


45 


50 


71 


35 


11 


3 


7 


1 


274 


27.6 


SB 


26 


43 


33 


83 


27 


55 


27 


28 


9 


30 


10 


366 


36.8 


S 


4 


1 


0 


6 


1 


13 


1 


10 


0 


0 


7 


43 


4.3 



almost exclusively to intrinsically luminous galaxies. Thirdly, the above statistics 
include galaxies belonging to the general field, to weak groups and to rich clusters 
of galaxies, but the fractions of the different morphological types vary with the 
local galaxy number density. In an important paper, Dressier plotted the frequency 
of different galaxy types as a function of the number density of galaxies in which 
they are found (Fig. 3.4) (Dressier, 1980). Field galaxies, that is, galaxies which are 
not members of groups or clusters of galaxies, are located towards the left of the 
diagram, while rich clusters of galaxies are towards the right. It can be seen that, in 
rich clusters, the elliptical and SO galaxies are much more common than the spiral 
galaxies, whereas in the general field, the majority of galaxies are spirals. Evidently, 
the environment in which a galaxy finds itself is correlated with its morphological 
characteristics. These relations will be quantified in more detail in Sect. 3.9.4. 



3.3 Peculiar and Interacting Galaxies 

The revised Hubble classification can encompass the forms of virtually all galaxies. 
There are, however, a number of galaxies with very strange appearances and these 
are referred to collectively as peculiar galaxies. Arp published his Atlas of Peculiar 
Galaxies in 1966 and a corresponding catalogue for the Southern Hemisphere 1987 
(Arp, 1966; Arp et al., 1987). As he remarked in the introduction to the first Atlas: 

The greatest deviations from the normal are emphasised in this atlas. 

A few galaxies are known, for example, in which the stellar component is in the form 
of a ring rather than a disc or spheroid, the Cartwheel being a beautiful example of 
this type of galaxy (Fig. 3.5); these are known as ring galaxies. 
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Fig. 3.4. The fractions of different morphological types of galaxy found in different galaxy 
environments. The local number density of galaxies is given as a projected surface density, 
Pproj °f galaxies, that is, numbers Mpc -2 (Dressier, 1980) 



Most of these remarkable structures are due to strong gravitational interactions, or 
collisions, between galaxies. In the early 1970s, Toomre and Toomre carried out pi- 
oneering computer simulations of close encounters between galaxies which showed 
how such events could give rise to remarkable asymmetric structures (Toomre and 
Toomre, 1972). In Fig. 3.6, a deep image of the pair of interacting galaxies known 
as the Antennae is shown, revealing the extraordinary long ‘tails’ which seem to be 
emanating from a pair of closely interacting spiral galaxies in which a great deal of 
recent star formation has occurred. 

The Toomres showed how such elongated ‘tails’ could be accounted for by 
a gravitational interaction between two spiral galaxies. In the simulation shown 
in Fig. 3.7, the two spiral galaxies pass close to each other on prograde orbits, 
that is, the rotational axes of the two discs are parallel and also parallel to the 
rotational axis of the two galaxies about their common centre of mass. The spiral 
galaxies are represented by differentially rotating discs of stars and, while they are 
at their distance of closest approach, the stars in the outermost rings feel the same 
mutual force acting upon them for a very much longer time than if the passage 
had been in, say, the retrograde direction. As a result, the outer rings of stars feel 
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Fig. 3.5. The peculiar galaxy known as the Cartwheel as observed by the Hubble Space 
Telescope. (Image courtesy of NASA, ESA and the Hubble Heritage Team (STScI/AURA)) 
Its strange appearance is almost certainly due to a recent collision, or strong interaction, with 
one of its nearby companions. The simulations by Toomre and Toomre show that such a ‘tidal 
wave’ is expected if a compact mass had passed through a spiral galaxy close to its centre 
(Toomre, 1974) 



a coherent force for an extended period and are stripped off to form the types 
of extended structure observed in the Antennae. Many of the features of strong 
gravitational interactions are described in the pioneering papers by the Toomres. 
Similar structures are found in more recent supercomputer simulations of colliding 
and interacting galaxies which can include millions of stars, as well as incorporating 
the dynamics of the interstellar gas in the collision (Barnes and Hernquist, 1996; 
Mihos and Hernquist, 1996; Springel, 2005). The compression of the interstellar 
media in the collision results in regions of intense star formation. 

Interactions between galaxies play a central role in many aspects of galactic 
evolution. From the observational point of view, the IRAS satellite showed that 
colliding galaxies are among the most luminous extragalactic far-infrared sources. 
The inference is that, when galaxies collide, their interstellar media are compressed 
to high densities and the rate of star formation is greatly enhanced, resulting in 
intense far-infrared emission. 

Collisions between galaxies have also assumed a central role in models of galaxy 
formation. In the preferred scenarios of structure formation, galaxies are built up 
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Fig. 3.6a, b. The strange pair of interacting galaxies known as the Antennae, named after 
the elongated structures apparently torn off in a collisions between the galaxies, a A wide 
field image showing the extended ‘tails’ produced in the encounter, b A ‘true colour’ Hubble 
Space Telescope image of the colliding galaxies. The many blue regions in the diagram contain 
young luminous stars, indicating that a large amount of star formation has been induced by 
the collision between the interstellar media of the interacting galaxies. (Image courtesy of 
NASA, ESA and the Hubble Heritage Team (STScI/AURA)) 



by the process of hierarchical clustering in which larger galaxies are formed by 
the coalescence of smaller galaxies. In this picture, strong gravitational encounters 
between galaxies are essential in forming the structures we observe today. Reference 
to Table 3.2 shows that the percentage of peculiar and interacting systems among the 
present population of galaxies is only about 1%. We will find that this percentage 
increases dramatically as we look further and further back in time, consistent with 
the hierarchical picture of structure formation. 



3.4 The Light Distribution in Galaxies 

Another approach to the classification of galaxies is to use their light distributions 
since it is found that these are somewhat different for bulge-dominated and disc- 
dominated systems. Let us first summarise the results of studies of bright galaxies 
and then show how these can be adapted for the study of large samples of galaxies. 
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Fig. 3.7. A simulation of a close encounter between two disc galaxies which approach each 
other on prograde orbits. It can be seen that the outer rings of stars are tom off each galaxy, 
forming the remarkable ‘Antenna’ structures (Toomre and Toornre, 1972) 
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3.4.1 Elliptical Galaxies 



The earliest expression for the observed surface brightness distribution of elliptical 
galaxies as a function of radius r is commonly referred to as Hubble ’s law 

^ + , (3.1) 

where r c is the core radius of the galaxy. This expression provides a reasonable 
description of the intensity distribution in the central regions of elliptical galaxies 
but is a poor fit in the outer regions. 

A much better description of the surface brightness distribution of elliptical 
galaxies and the bulges of spiral galaxies is the empirical law proposed by de 
Vaucouleurs which is usually referred to as the r 1/4 law (de Vaucouleurs, 1948) 
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This expression provides a good representation of the luminosity profile over many 
decades of surface brightness. The expression has been normalised so that r e is 
the radius within which half the total luminosity is emitted and I(r e ) is the surface 
brightness at that radius. The corresponding total luminosity of the galaxy is 

L = 7.2157r/ e r e 2 ^ , (3.4) 

where b/a is the apparent axis ratio of the elliptical galaxy (Gilmore et al., 1989). 

3.4.2 Spiral and Lenticular Galaxies 

The light distributions in most spiral and lenticular galaxies can be decomposed 
into two components, a spheroidal component associated with the central bulge and 
a disc component. The luminosity profile of the spheroidal component is the same 
as that of an elliptical galaxy and may be described by the de Vaucouleurs r ]/4 law 
discussed above. 

In almost all galaxies in which there is evidence of a disc component, including 
spirals, barred spirals and lenticulars, the luminosity profile of the disc may be 
represented by an exponential light distribution 



I(r) = Iq exp(— r//?) , 



(3.5) 
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where h is called the disc scale length ; for our own Galaxy h & 3 kpc. Iq is the central 
surface brightness. The total luminosity of the disc is then L = 4nh 2 Iq. According 
to Freeman, this luminosity profile is also found in very late-type galaxies, such as 
the Magellanic irregulars which show evidence of a disc in rotation. 

In 1970, Freeman discovered the remarkable result that, although the disc com- 
ponents of large disc galaxies have a wide range of luminosities, there is remarkably 
little scatter in the value of the central surface brightness Iq (Freeman, 1970). A mean 
value of Iq = 21.67 ± 0.3 B magnitudes arcsec 2 was found for the galaxies in the 
sample studied by Freeman, the differences in total luminosity being due to variations 
in the scale length of the light distribution h from galaxy to galaxy. There has been 
considerable debate about the validity of this result because the samples of galaxies 
studied are strongly influenced by selection effects, in particular, the galaxies have 
to be bright enough and large enough for precise surface photometric observations 
to be possible. Disney suggested that the constancy of the central surface brightness 
of the discs of spiral galaxies could be largely attributed to these selection effects 
(Disney, 1976). Van der Kruit surveyed a number of attempts to remove the effects 
of observational selection from the samples studied and concluded that ‘non-dwarf 
galaxies do have a relatively narrow dispersion of central surface brightnesses, and 
this is not the result of selection effects’ (Gilmore et al., 1989). It is certainly the 
case that, for low luminosity spiral galaxies, the values of central surface brightness 
are smaller Freeman’s standard value, the most extreme cases being the low surface 
brightness disc galaxies which can have Iq as low as 25.5 B magnitudes arcsec -2 . 

3.4.3 Putting the Light Distributions Together 

A convenient way of combining the light distributions of elliptical and spiral galax- 
ies is to adopt the formulation proposed by Sersic which can be thought of as 
a generalisation of de Vaucouleurs’ r ,/4 law (Sersic, 1968) 



where r e is the radius within which half of the total light is emitted and the b n is 
a normalisation constant to ensure that the total light sums to L tot for a given value 
of 77. It can be seen that the value n — 4 results in de Vaucouleurs’ 7 - 1/4 law and 
77 = 1 in the exponential law found in the discs of spiral galaxies. 

This formalism can be used to discriminate between disc-dominated and bulge- 
dominated galaxies. A beautiful example of the application of Sersic’s formula to 
a large sample of galaxies is shown in Fig. 3.8a which shows the distribution of 
77 among 10,095 galaxies selected from the Millennium Galaxy Catalogue (Driver 
et al., 2006). It can be seen that the galaxy sample splits very beautifully into two 
populations, one centred on the value n = 4, corresponding to the elliptical galaxies 
and the bulges of spiral galaxies and the other the value n = 1, corresponding to 
the light distribution of disc galaxies. In the analysis by Driver and his colleagues, 
the morphological categories were checked by visual inspection of the images of the 
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Fig. 3.8. a A plot of the observed value of the Sersic index n as a function of the absolute 
blue magnitude in a sample of 10,095 galaxies from the Millennium Galaxy Catalogue, b The 
histogram showing the number of galaxies in equal logarithmic bins of Sersic index n (Driver 
et al., 2006) 



galaxies. The distinction between the bulge-dominated and disc-dominated systems 
occurs at about n ~ 2. As we will see, these structural parameters are strongly 
correlated with other properties of the galaxies, in particular, with the red and blue 
galaxy sequences described in Sect. 3.9.2. This procedure is particularly valuable 
in distinguishing the structural properties of large samples of galaxies by computer 
analyses. 



3.5 The Masses of Galaxies 

All direct methods of measuring masses in astronomy are dynamical. For systems 
such as star clusters, galaxies and clusters of galaxies, it can generally be assumed 
that they have reached some form of dynamical equilibrium and then, by measuring 
the velocities of the objects which make up the system and knowing its dimensions, 
mass estimates can be made. A key result for determining the masses of galaxies and 
clusters of galaxies is the virial theorem, first derived for star clusters by Eddington 
in 1916 (Eddington, 1916). 

3.5.1 The Virial Theorem for Clusters of Stars, Galaxies 
and Clusters of Galaxies 

Star clusters, galaxies and clusters of galaxies can generally be considered to be 
gravitationally bound configurations, meaning that the stars or galaxies have come 
into dynamical equilibrium under gravity. This assertion is supported by comparison 
of the crossing time of an object within the system with its age. The crossing time is 
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defined to be t cr = R/( v) where R is the size of the system and (v) is the typical speed, 
or velocity dispersion, of the objects of which it is composed. For example, the orbital 
speed of the stars in our Galaxy at our distance from its centre, 8.5 kpc, is about 220 
km s _l . Therefore, the time it takes the stars to make one complete revolution about 
the centre of our Galaxy is t — 2rcR/v 2.5 x 10 8 years. This is very much less 
than the age of the Galaxy, which is about 1.3 x 10 10 years, and so the system must 
be gravitationally bound. Similarly, in the Coma cluster of galaxies, the crossing 
time is less than about one-tenth the age of the Universe, indicating that the cluster 
must be gravitationally bound, or else the galaxies would have dispersed long ago. 

The virial was introduced by Rudolph Clausius in 1870 in connection with the 
thermal energy of gases. The virial was defined to be the quantity 3, = — i(r,- ■ F,} 
where the force F, acts on the particle i located at position vector r ; . The angle 
brackets represent the time average of the force acting on the particle and Clausius 
showed that 3, is the system’s average kinetic energy (see p. 105 in (Sparke and 
Gallagher, 2000)). In the astronomical context, the theorem refers to the energy 
balance in systems in equilibrium under gravity and it is found in a variety of 
different guises. In its application to the internal properties of stars, the virial theorem 
describes the relation between the thermal energy of the gas and its gravitational 
potential energy. The theorem can be extend to include rotational energy, magnetic 
energy, the energy in the form of convective motions or turbulence, and so on. In 
stellar dynamics, in which the ‘gas’ of stars may be taken to be collisionless, the 
tensor virial theorem relates the equilibrium state to the energies associated with the 
velocity distribution of the stars at each point, which will in general be anisotropic 
(Binney and Tremaine, 1987). In this section, we will only consider the simplest 
form of virial theorem for a self-gravitating system of point masses. 

Suppose a system of particles (stars or galaxies), each of mass m,, interact with 
each other only through their mutual forces of gravitational attraction. Then, the 
acceleration of the ;th particle due to all other particles can be written vectorially 




(3.7) 



Now, take the scalar product of both sides with m, r, . 




(3.8) 



Differentiating (r, ■ r,) with respect to time 




(3.9) 



and then, taking the next derivative, 




(r,- ■ r,) = (r,- ■ r, + r, ■ r,) = (r,- • r, + if) . 



(3.10) 
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Therefore, (3.8) can be rewritten 
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Now we sum over all the particles in the system, 
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Now the double sum on the right-hand side represents the sum over all the elements 
of a square n x n matrix with all the diagonal terms zero. If we sum the elements ij 
and ji of the matrix, we find 
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Therefore, 
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(3.13) 
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where the factor \ on the right-hand side is included because the sum is still over all 
elements of the array and so the sum of each pair would be counted twice. 

Now, JT 7M, r? is twice the total kinetic energy, T, of all the particles in the 
system, that is, 
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The gravitational potential energy of the system is 
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Therefore, 
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If the system is in statistical equilibrium 
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and therefore 



T = \\U\ . (3.19) 

This is the equality known as the virial theorem in stellar dynamics. 

Notice that, at no point, have we made any assumption about the orbits or velocity 
distributions of the particles. The velocities might be random, as is often assumed to 
be the case for globular clusters or spherical elliptical galaxies, but they might also 
have highly elongated orbits about the centre of the galaxy. In the case of the discs 
of spiral galaxies, the velocity vectors are highly ordered and the mean rotational 
speed about the centre is much greater than the random velocities of the stars. In 
all these cases, the virial theorem must hold if the system is to remain in dynamical 
equilibrium. In its simplest form, the expression (3.19) tells us nothing about the 
velocity distribution of the stars or galaxies within the system. 

Despite the elegance of the theorem, its application to astronomical systems is 
not straightforward. In most cases, we can only measure directly radial velocities 
from the Doppler shifts of spectral lines and positions on the sky. In some cases, 
independent distance measures of the stars or galaxies within the system are available, 
but generally, within star clusters and clusters of galaxies, it is not possible to 
distinguish whether the objects are on the near or far side of the cluster. In some cases, 
the proper motions of the objects can be measured and then their three-dimensional 
space motions can be found. Generally, for clusters of galaxies, we need to make 
assumptions about the spatial and velocity distributions of galaxies in the cluster. 
For example, if we assume that the velocity distribution of the galaxies is isotropic, 
the same velocity dispersion is expected in the two perpendicular directions as along 
the line of sight and so (v 2 ) = 3(uJ), where i)|| is the radial velocity. If the velocity 
dispersion is independent of the masses of the stars or galaxies, we can find the total 
kinetic energy T — (1 /2) m,r? = (3/2 )M(uJ), where M is the total mass of the 
system. If the velocity dispersion varies with mass, then (u?) is a mass-weighted 
velocity dispersion. If the system is spherically symmetric, we can work out from the 
observed surface distribution of stars or galaxies a suitably weighted mean separation 
R c \, so that the gravitational potential energy can be written U = G M 2 / R c \. Thus, 
the mass of the system can be found from the virial theorem 

T — \\U\ M=3(vj)R cl /G. (3.20) 

Notice that, in general, we have to estimate some characteristic velocity, or velocity 
dispersion, and the size of the system in order to find its mass. This general result is 
widely applicable in astrophysics. 

3.5.2 The Rotation Curves of Spiral Galaxies 

In the case of spiral galaxies, masses can be estimated from their rotation curves, 
that is, the variation of the orbital, or rotational, speed u ro t(r) about the centre of 
the galaxy with distance r from its centre. Examples of the rotation curves of spiral 
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galaxies derived from optical and radio 21-cm line studies are shown Fig. 3.9b 
(Bosma, 1981). In a few galaxies, there is a well-defined maximum in the rotation 
curve and the velocity of rotation decreases monotonically with increasing distance 
from the centre. If this decrease continues to infinite distance, the total mass of 
the galaxy converges and is similar to that derived from the rotation curve in the 
central regions. In many cases, however, the rotational velocities in the outer regions 
of galaxies are remarkably constant with increasing distance from the centre. It is 
apparent from Fig. 3.9a that the flat rotation curve of our spiral neighbour M31 
extends far beyond the optical image of the galaxy. 

The significance of these flat rotation curves can be appreciated from application 
of Gauss’s theorem to Newton’s law of gravity. For simplicity, let us assume that 
the distribution of mass in the galaxy is spherically symmetric, so that we can 
write the mass within radius r as M(< r). According to Gauss’s law for gravity, 
for any spherically symmetric variation of mass with radius, we can find the radial 
acceleration at radius r by placing the mass within radius r, M{< r), at the centre of 
the galaxy. Then, equating the centripetal acceleration at radius r to the gravitational 
acceleration, we find 



GM(< r ) _ tr ot (r) 



M(< r ) = 



vjt (TV 
G 



(3.21) 



For a point mass, say the Sun, M(< r ) = M 0 , and we recover Kepler’s third law 
of planetary motion, the orbital period T being equal to 2nr/v ml oc r 3/2 . This result 
can also be written i> ro t oc r l,/2 and is the variation of the circular rotational velocity 
expected in the outer regions of a galaxy if most of the mass is concentrated within 
the central regions. 

If the rotation curve of the spiral galaxy is flat, u ro , = constant, M{< r) oc r and 
so the mass within radius r increases linearly with distance from the centre. This 
contrasts dramatically with the distribution of light in the discs, bulges and haloes of 
spiral galaxies which decrease exponentially with increasing distance from the centre 




Fig. 3.9. a The rotation curve for the nearby giant spiral galaxy M3 1 , showing the flat rotation 
curve extending well beyond the optical image of the galaxy (Courtesy of Dr. Vera Rubin) 
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Fig. 3.9. (continued) b Examples of the rotation curves of spiral galaxies from optical and 
neutral hydrogen observations (Bosma, 1981) 

(see Sect. 3.4.2). Consequently, the local mass-to-luminosity ratio must increase in 
the outer regions of spiral galaxies. 

It is most convenient to quote the results in terms of mass-to-luminosity ratios 
relative to that of the Sun. For the visible parts of spiral galaxies, for which the rotation 
curves are well-determined, mean mass-to-light ratios in the B waveband in the 
range 1-10 are found. This is similar to the value found in the solar neighbourhood; 
averaging over the masses and luminosities of the local stellar populations, a value 
of M/L ~ 3 is found. The M/L ratio must however increase to much larger values at 
large values of r. Values of M/L 10-20 M Q /L Q are found in the outer regions of 
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spiral galaxies, similar to the values found for elliptical galaxies. These data provide 
crucial evidence for the presence of dark matter in galaxies. 

There are theoretical reasons why spiral galaxies should possess dark haloes. 
Ostriker and Peebles showed that, without such a halo, a differentially rotating disc 
of stars is subject to a bar instability (Ostriker and Peebles, 1973). Their argument has 
been confirmed by subsequent computer simulations and suggests that dark haloes 
can stabilise the discs of spiral galaxies. We will return to the thorny question of the 
nature of the dark matter in Chap. 4. 

3.5.3 The Velocity Dispersions of Elliptical Galaxies 

Expression (3.20) can be used to estimate the masses of elliptical galaxies. Doppler 
broadening of the widths of stellar absorption lines in galaxies can be used to estimate 
the velocity dispersion ( Awjj) of stars along the line of sight through the galaxy. Typ- 
ical mass-to-luminosity ratios for elliptical galaxies are about 10-20 Mq/Lq. The 
trouble with this argument is that it has to be assumed that the velocity distribution of 
the stars in the elliptical galaxy is isotropic. As will be discussed in Sect. 3.6.3, there is 
compelling evidence that in general elliptical galaxies are triaxial systems and so the 
isotropy of the stellar velocity distribution needs to be tested directly by observation. 

Evidence that there must indeed be considerable amounts of dark matter in 
the haloes about two of the giant elliptical galaxies in the Virgo cluster, M49 and 
M87, has been presented by Cote and his colleagues (Cote et ah, 2001, 2003). They 
measured the radial velocities of a large sample of globular clusters in the haloes of 
these galaxies and so were able to extend the range of radii over which the velocity 
dispersion in these galaxies could be measured. A beautiful example of the quality of 
their data for M49 is shown in Fig. 3.10. Their measurements are shown by the filled 
circles at radii R > lOkpc, the dotted and solid lines bracketing them showing the 
one and two sigma ranges of their estimates of the velocity dispersion. The points 
at radii less than lOkpc show the velocity dispersion measured by other authors and 
it can be seen that the data are consistent with the velocity dispersion remaining 
remarkably constant out to radii up to 40 kpc from the centre. 

Various attempts to account for the variation of the velocity dispersion with 
radius are indicated by the different lines on the diagram. These assume that the mass 
distribution follows the radial optical intensity distribution, but with various extreme 
assumptions about the anisotropy of the stellar velocity distribution. Even models 
in which the stars (or globular clusters) are on radial orbits cannot account for the 
fact that the line-of-sight velocity dispersion is independent of radius out to 40 kpc. 
Cote and his colleagues conclude that these data provide evidence that the velocity 
dispersion is isotropic and that there must be dark matter haloes about these galaxies. 
Similar conclusions can be drawn from X-ray observations of these galaxies, using 
the technique described in the context of clusters of galaxies in Sect. 4.4. 

Physically, the fact that the velocity dispersion remains constant out to large 
radii has exactly the same explanation as the flatness of the rotation curves of spiral 
galaxies (see expression 3.21). To bind globular clusters to these massive galaxies 
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Fig. 3.10. The velocity dispersion of stars and globular clusters in the nearby giant elliptical 
galaxy M49 (NGC 4472). The data points at R < 10 kpc are obtained from the velocity 
width of the stellar absorption lines. The filled circles at radii R > 10 kpc are derived from 
the velocity dispersion of globular clusters. Various models for the velocity dispersion as a 
function of radius R, assuming that the mass follows the light are shown (Cote et al., 2003) 



with these large velocity dispersions at large radii means that the mass within radius 
R must increase proportional to R. 



3.6 The Properties of Spiral and Elliptical Galaxies 

At first glance, it would seem that the elliptical galaxies should be among the simpler 
stellar systems to interpret theoretically because they can be approximated as single 
spheroidal stellar distributions. This turns out to be an overoptimistic expectation. 

3.6.1 The Faber-Jackson Relation and the Fundamental Plane 

Extensive studies have been made of correlations between various properties of 
elliptical galaxies, specifically, their luminosities, their sizes, as described by the de 
Vaucouleurs radius r e , their central velocity dispersions, their surface brightnesses, 
the abundance of heavy elements, and so on. Of these, two studies are of particular 
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importance. The first is the analysis of Faber and Jackson who found a strong 
correlation between luminosity L and central velocity dispersion a of the form 
L oc cr x where x ~ 4 (Faber and Jackson, 1976). This correlation has been studied by 
other authors who have found values of x ranging from about 3 to 5. The significance 
of this relation is that, if the velocity dispersion a is measured for an elliptical galaxy, 
its intrinsic luminosity can be found from the Faber-Jackson relation and hence, by 
measuring its observed flux density, its distance can be found. 

This procedure for measuring distances was refined by Dressier and his col- 
leagues and by Djorgovski and Davis who introduced the concept of the fundamental 
plane for elliptical galaxies (Dressier et al., 1987; Djorgovski and Davis, 1987). The 
fundamental plane lies in a three-dimensional space in which luminosity L is plotted 
against the central velocity dispersion a and the mean surface brightness Z e within 
the half-light radius r e , that is, Z e — L(< rf/rtr^. Dressier and his colleagues 
found an even stronger correlation than the Faber-Jackson relation when the surface 
brightness was included. 



L oc er 8/3 . (3.22) 

Various expressions for the fundamental plane appear in the literature, for example 

r e oc a 1 ' 4 /” 0 ' 9 (3.23) 

which is remarkably similar to (3.22). 

Dressier and his colleagues found just as good a correlation if they introduced 
a new diameter D n , which was defined as the circular diameter within which the 
total mean surface brightness of the galaxy exceeded a particular value. The surface 
brightness was chosen to be 20.75 B magnitudes arcsec 2 . The correlation found 
was a oc D,,' 4 , thus incorporating the dependence of both L and E e into the new 
variable D n . 

The origin of these empirical correlations is not understood. The argument can 
be inverted to determine under what conditions relations such as the Faber-Jackson 
relation would be found. For example, since the mass of the galaxy is given by 
M oc a 2 /- e and L oc 7 e r 2 , it follows that L oc a 4 / 7 e (M / L) 2 . Thus, if 7 e and M/L 
were constant for all elliptical galaxies, we would obtain L oc a 4 . It is not at all 
clear, however, why 7 e and M/L should be constant for elliptical galaxies. 

Despite the lack of theoretical underpinning of these correlations. Dressier and 
his colleagues estimate that they enable the distances of individual galaxies to be 
determined to about 25% and for clusters of galaxies to about 10%. 

3.6.2 Ellipticals Galaxies as Triaxial Systems 

It might be thought that the internal dynamics of elliptical galaxies would be relatively 
straightforward. Their surface brightness distributions appear to be ellipsoidal, the 
ratio of the major to minor axes ranging from 1 : 1 to about 3: 1 . It is natural to attribute 
the flattening of the elliptical galaxies to the rotation of these stellar systems and this 



72 



3 Galaxies 



can be tested by observations of the mean velocities and velocity dispersions of the 
stars throughout the body of the galaxy. These measurements can be compared with 
the rotation and internal velocity dispersions expected if the flattening of the elliptical 
galaxies were wholly attributed to the rotation of an axisymmetric distribution of 
stars. In the simplest picture, it is assumed that the velocity distribution is isotropic 
at each point within the galaxy. 

Bertola and Capaccioli in 1975 and Illingworth in 1977 first showed that elliptical 
galaxies rotate too slowly for centrifugal forces to be the cause of their observed 
flattening; in other words, the ratio of rotational to random kinetic energy is too 
small (Bertola and Capaccioli, 1975; Illingworth, 1977). This analysis was repeated 
in 1983 for a larger sample of elliptical galaxies and for the bulges of spiral galaxies 
by Davies and his colleagues with the results shown in Fig. 3.11 (Davies et al., 
1983). The solid lines show the amount of rotation v m necessary to account for the 
observed ellipticity of the elliptical galaxy relative to the velocity dispersion a of the 
stars. It can be seen that, for low luminosity elliptical galaxies and for the bulges of 
spiral galaxies, the ellipticity of the stellar distribution can be attributed to rotation. 
The most luminous ellipticals with Mg < —20.5 generally do not possess enough 
rotation to account for the observed flattening of the galaxies. This means that the 
assumptions of an axisymmetric spatial distribution and/or an isotropic velocity 
distribution of stars at all points within the galaxy must be wrong. As a consequence, 
these massive elliptical galaxies must be triaxial systems, that is, systems with three 
unequal axes and consequently with anisotropic stellar velocity distributions. There 
is no reason why the velocity distribution should be isotropic because the time- 
scale for the exchange of energy between stars through gravitational encounters is 
generally greater than the age of the galaxy. Therefore, if the velocity distribution 
began by being anisotropic, it would not have been isotropised by now. 

Further evidence for the triaxial nature of massive elliptical galaxies has come 
from studies of their light distributions. In many systems not only does the ellipticity 
of the isophotes of the surface brightness distribution vary with radius, but also the 
position angle of the major axis of the isophotes can change as well. All types of 
variation of ellipticity with radius are known. In some cases there is a monotonic 
change with radius but in others there can be maxima and minima in the radial 
variation of the ellipticity (Bertola and Galletta, 1979). The dynamics of such galaxies 
must be much more complicated than those of a rotating isothermal gas sphere. 
Another piece of evidence for the complexity of the shapes and velocity distributions 
within elliptical galaxies comes from the observation that, in some ellipticals, rotation 
takes place along the minor as well as along the major axis (Bertola et al., 1991). Thus, 
despite their simple appearances, some elliptical galaxies may be triaxial systems. 

The theoretical position has been clarified by an elegant and original analysis by 
Martin Schwarzschild (Schwarzschild, 1979). By applying linear programming tech- 
niques to the determination of orbits in general self-gravitating systems, he showed 
that there exist stable triaxial configurations not dissimilar from those necessary to 
explain some of the internal dynamical properties of what appear on the surface 
to be simple ellipsoidal stellar distributions. His analysis showed that there exist 
stable orbits about the major and minor axes but not about the immediate axis of the 



3.6 The Properties of Spiral and Elliptical Galaxies 



73 




Fig. 3.11. A diagram showing the flattening of elliptical galaxies as a function of their rota- 
tional velocities. The open circles are luminous elliptical galaxies, the filled circles are lower 
luminosity ellipticals and the crosses are the bulges of spiral galaxies. If the ellipticity were 
entirely due to rotation with an isotropic stellar velocity distribution at each point, the galaxies 
would be expected to lie along the solid line. This diagram shows that, at least for massive 
ellipticals, this simple picture of rotational flattening cannot be correct (Davies et al., 1983) 



triaxial figure. With this new understanding of the stellar motions in elliptical galax- 
ies, galaxies can be characterised as oblate-axisymmetric, prolate-axisymmetric, 
oblate-triaxial, prolate-triaxial and so on. 

3.6.3 The Tully-Fisher Relation for Spiral Galaxies 

The masses of spiral galaxies can be estimated from their rotation curves as described 
inSect. 3.5.2. In 1975, Tully and Fisher discovered that, for spiral galaxies, the widths 
of the profiles of the 21 -cm line of neutral hydrogen, once corrected for the effects of 
inclination, are strongly correlated with their intrinsic luminosities (Tully and Fisher, 
1977). In their studies, they correlated the total B luminosities with the corrected 
velocity width A V of the 2 1 -cm line and found the relation 



L b oc AT" , 



(3.24) 
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where a — 2.5. A much larger survey carried out by Aaronson and Mould found 
a somewhat steeper slope, a — 3.5, for luminosities measured in the optical B 
waveband and an even steeper slope, a — 4.3 in the near-infrared H waveband 
at 1.65 pm (Aaronson and Mould, 1983). The correlation was found to be much 
tighter in the infrared as compared with the blue waveband, because the luminosities 
of spiral galaxies in the blue waveband are significantly influenced by interstellar 
extinction within the galaxies themselves, whereas, in the infrared waveband the dust 
becomes transparent. What has come to be called the infrared Tully-Fisher relation 
is very tight indeed. As a result, measurement of the 21 -cm velocity width of a spiral 
galaxy can be used to infer its absolute H magnitude and hence, by measuring its 
flux density in the H waveband, its distance can be estimated. This procedure has 
resulted in some of the best distance estimates for spiral galaxies and has been used 
in programmes to measure the value of Hubble’s constant. 

There is an interesting interpretation of the Tully-Fisher relation for exponential 
discs. Suppose that the mass distribution follows the same distribution as the optical 
surface brightness with radius, I — Iq exp(— r/h). Then, the total mass of the disc is 

pOO nOO 

M= I 2TcrIoe~ r / h dr = 2 ttIoIi 2 I xe~ x dx — 2nIoh 2 . (3.25) 

Jo Jo 

Thus, most of the mass of the disc lies within radius r ~ h. The maximum of the 
rotation curve therefore corresponds roughly to the Keplerian velocity at distance 
h from the centre. Placing all the mass at the centre of the disc and equating the 
centripetal and gravitational accelerations, the maximum of the rotation curve is 
expected to correspond to Vjn ax where 



^max ^ 27tGI 0 h 2 ' 
h h 2 



F m ax OC (/ 0 /i) 1/2 • 



(3.26) 



Eliminating h from (3.25) and (3.26), we find that M oc V^ ax . If we now adopt 
Freeman’s result that the central surface brightnesses of bright spiral galaxies have 
a roughly constant value and assume that the mass-to-luminosity ratio is constant 
within the discs of spiral galaxies, we expect L oc V^ ax , roughly the observed 
Tully-Fisher relation. 



3.6.4 Luminosity-Metallicity Relations 

An important aspect of the physics of galaxies which will play an important role 
in understanding their formation is the relation between their luminosities, masses, 
colours and the abundances of the heavy elements, the last being referred to as their 
metallicities. The observations are well-understood, but their interpretation is subject 
to many caveats and uncertainties. 

For elliptical galaxies, Faber showed in 1973 showed that there is a correlation 
between their luminosities and the strength of the magnesium absorption lines (Faber, 
1973). In subsequent analyses, a similar relation was established over a wide range 
of luminosities and between the central velocity dispersion of the elliptical galaxy 
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and the strength of the Mg 2 index (Bender et al., 1993). They also showed that the 
Mg 2 index was strongly correlated with the (B - V) colours of the bulges of these 
galaxies and so the correlation referred to the properties of the galaxy as a whole. The 
result was a strong correlation between luminosity, colour and metallicity spanning 
almost a factor of 1000 in luminosity (Fig. 3.12). 

A similar relation was found by Visvanathan and Sandage for elliptical galaxies 
in groups and clusters of galaxies in the sense that the more luminous the galaxy, 
the redder they were observed to be (Visvanathan and Sandage, 1977). Their prime 
interest was in using this correlation in groups and clusters of galaxies to estimate 
their distances, but the sense of the correlation is the same as that found by Faber and 
her colleagues since galaxies with greater metallicities have greater line blanketing 
in the blue and ultraviolet regions of the spectrum and hence are redder than their 
lower metallicity counterparts. 

A similar correlation was first established for late-type and star forming galaxies 
by Lequeux and his colleagues (Lequeux et al., 1979). These pioneering studies 
involved determining the gas-phase metallicities of the galaxies and were followed 
by a number of studies which extended the luminosity-metallicity correlation to 
a range of 1 1 magnitudes in absolute luminosity and a factor of 100 in metallicity 
(Zaritsky et al., 1994). These studies laid the foundation for the analyses of the huge 
databases of galaxies available from the Sloan Digital Sky Survey. 




Fig. 3.12. The correlation between the luminosity of the bulges or spheroids of elliptical 
galaxies, what Faber and her colleagues refer to as ‘Dynamically Hot Galaxies’, and the 
strength of their absorption lines due to heavy elements, as measured by the Mg 2 index 
(Bender et al., 1993) 
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In the analysis of Tremonti and her colleagues, rather than using luminosity, 
they work directly with the stellar mass of the galaxy (Tremonti et al., 2004). This 
approach has become feasible thanks to the development of efficient and reliable 
codes for determining the stellar and gaseous masses of galaxies from their optical 
spectra (Bruzual and Chariot, 2003; Chariot and Longhetti, 2001). It turns out that 
the correlation with stellar mass is stronger than that with luminosity. Figure 3.13 
shows the strong correlation between metallicity and the total stellar mass of star- 
forming galaxies. These observations provide important constraints on the physics 
of the evolution of galaxies. With the advent of 8-10-metre class telescopes, these 
studies have been extended to samples of galaxies at large redshifts and so constrain 
directly the evolution of the stellar and gaseous content of galaxies of different 
masses (Savaglio et al., 2005). These topics will be taken up in much more detail in 
Chaps. 17 to 19. 




Fig. 3.13. The stellar mass-gas phase metallicity relation for 53,400 star-forming galaxies 
from the SDSS. The large black points represent the median in bins of 0.1 dex in mass which 
include at least 100 data points. The thin line through the data is a best-fitting smooth curve 
and the solid lines are the contours which enclose 68% and 95% of the data (Tremonti et al., 
2004) 
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3.7 The Luminosity Function of Galaxies 

The frequency with which galaxies of different intrinsic luminosities are found in 
space is described by the luminosity function of galaxies. The luminosity function 
of galaxies <p(L ) dL is defined to be the space density of galaxies with intrinsic 
luminosities in the range L to L + dL. If S is the flux density (in W m 1 Hz -1 ) 
of a nearby galaxy, for which redshift corrections can be neglected, the luminosity 
of the galaxy is L = 4jtr 2 S (in W Hz 1 ), where r is the distance of the galaxy. 
In optical astronomy, it is traditional to work in terms of absolute magnitudes, M, 
rather than luminosities and so, in terms of absolute magnitudes, the luminosity 
function <p(L) d L = @(M) AM. The important difference between these two forms 
of the luminosity function is that, in terms of magnitudes, the luminosity function is 
presented on a logarithmic scale of luminosity. The absolute magnitude M and the 
luminosity L are related by the expression 

lo g(~)=-0.4 (M-M*). (3.27) 

where the absolute magnitude M* and the luminosity L* are corresponding reference 
values of these quantities. 

In 1977, Felten made a careful comparison of nine different determinations of the 
local luminosity function for nearby galaxies, reducing them all to the same value of 
Hubble’s constant, the same magnitude system and the same corrections for Galactic 
extinction. In this heroic analysis, he found that the independent determinations were 
in remarkably good agreement (Felten, 1977). Felten’s analysis is summarised in 
Fig. 3.14, using reduced absolute magnitudes, M B o in de Vaucouleurs’ By magnitude 
system and using a Galactic extinction law Ab = 0.25 cosec \b\. The solid line shows 
a best-fit to the data of the form of luminosity function proposed by Schechter 

</>(x) Ax = <p*x a er x dx , (3.28) 

or, 

4>(L)dL = <p* (A) exp(-A) (3.29) 

where x — L/L* and L* is the luminosity which characterises the ‘break’ in the 
luminosity function seen in Fig. 3.14 (Schechter, 1976). The form of the Schechter 
luminosity function is as simple as it could be: a power law with a high luminosity 
exponential cut-off. Its shape is characterised by two parameters, the slope of the 
power law ct at low luminosities and the ‘break’ luminosity L*. 

It is traditional in optical astronomy to write the luminosity function in terms of 
astronomical magnitudes rather than luminosities and then the beautiful simplicity 
of the Schechter function is somewhat spoiled: 

<P(M) AM = \(j>* In 10 {dex[0.4(M* - M)]}“ +1 
x exp {— dex[0.4(M* — Af)]} AM , 



(3.30) 
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M b(0) +5 log l0 (50/H1+ ( A„ -0.25 esc |b| ) 



Fig. 3.14. The local luminosity function of galaxies from nine independent estimates con- 
sidered by Felten fitted by a Schechter luminosity function of the form n(x) dx oc x a e~ x dx, 
where x = L/L* (Felten, 1977) 



where M* is the absolute magnitude corresponding to the luminosity L*. We have 
used the notation dex y to mean HP. In his reassessment of the luminosity function 
for galaxies in 1985, Felten preferred the following best-fit values: a = —1.25 and 
M B o = —20.05 + 51og 10 /z (Felten, 1985). 

The normalisation factor <p* determines the space density of galaxies and al- 
lowance has to be made for the fact that the galaxies used in the determination 
mostly lie within the local supercluster. Hence, the value of <p * is an overestimate as 
compared with what would be found for a sample of field galaxies. Felten’s preferred 
value of <p* for the general field was 1.20 x HR 2 /? 3 Mpc 3 . 

These pioneering efforts by Felten were followed by careful studies of larger 
and larger samples of galaxies (see, for example, the review by Binggeli, Sandage 
and Tammann (Binggeli et al., 1988)), culminating in the analyses of very large 
samples of galaxies observed in the 2dF and Sloan Digital Sky Survey (SDSS) galaxy 
surveys . These very large surveys sample such large volumes of the Universe that 
the problems of correcting for the presence of the local supercluster are not relevant. 
Recent determinations of the luminosity function of galaxies from these surveys 
are shown in Fig. 3.15. The 2dF galaxy survey included 221,414 galaxies for all of 
which spectroscopic redshifts and colours were available (Fig. 3.15a). The overall 
luminosity function, as well as the functions for red and blue galaxies are shown on 
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Fig. 3.15. a The luminosity function of galaxies derived from a sample of 221,414 galaxies ob- 
served in the 2dF galaxy survey. The overall luminosity function and those of the red and blue 
galaxies in the sample have been fitted by Schechter luminosity functions (Cole et al., 2005) 



Table 3.4. Parameters describing the overall luminosity function of galaxies from the 2dF and 
SDSS surveys. The functions are determined at a redshift of 0. 1 and include K-corrections 
and evolutionary corrections for the observed change in form of the luminosity functions over 



the redshift interval 0 


< z < 0.3 








Galaxy survey 


Waveband 


cj>*/h 3 Mpc" 3 


M* - 5 log jo h 


a 


2dF galaxy survey 


bj 


0.0156 


-19.52 


-1.18 


SDSS galaxy survey 


r 


0.0149 ±0.0004 


-20.44 ±0.01 


-1.05 ±0.01 



the diagram. In the case of the SDSS survey, redshifts were determined for 147,986 
galaxies (Fig. 3.15b) (Blanton et al., 2003). The best-fit parameters describing the 
overall luminosity function for these two large surveys are listed in Table 3.4. It can 
be seen that the form of these functions are in good agreement. 

3.7.1 Aspects of the Luminosity Function of Galaxies 

A number of features of the luminosity function of galaxies should be noted. 

Dependence upon galactic environment. With the availability of large unbiased 
samples of galaxies, it is possible to determine the luminosity function for galaxies 
of different morphological types in different environments, such as clusters, groups 
and void regions. The evidence of Fig. 3. 15a shows that there is a clear difference in 
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Fig. 3.15. (continued) b The luminosity function of galaxies derived from a sample of 
147,986 galaxies observed in the Sloan Digital Sky Survey. The magnitudes are measured in 
the r* waveband as observed at a redshift z = 0.1. The observations are very well-fitted by 
a Schechter luminosity function with the parameters given on the diagram and in Table 3.4 
(Blanton et al., 2003) 



the luminosity functions of red and blue selected galaxies, corresponding to early and 
late-type galaxies respectively in the Hubble sequence. In addition, there is evidence 
that the luminosity function of galaxies in rich clusters differs from that of galaxies 
in underdense regions of the Universe, the void regions. Using data from the 2dF 
galaxy survey, Croton and his colleagues showed clear differences in the forms of 
the luminosity functions for early and late-type galaxies as a function of the over 
or underdensity of the region relative to the mean density of galaxies (Fig. 3.16) 
(Croton et al., 2005). The population in the voids is dominated by late-type galax- 
ies and shows, relative to the mean, a deficit of early-type galaxies that becomes 
increasingly pronounced at magnitudes fainter than Mb, — 51og 10 /7 = —18.5. In 
contrast, clusters show a relative excess of very bright early-type galaxies with 
M bj - 51og 10 h < -19. 

These facts combined with the differences in the relative numbers of galaxies 
of different morphological types as a function of galaxy density indicate that the 
approximation of a universal luminosity function for all galaxies wherever they are 
found in the Universe is, at best, a rough approximation. 
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Fig. 3.16. The luminosity functions for early and late-type galaxies in rich clusters of galax- 
ies and in large voids (Croton et al., 2005). As they express this result, the void popula- 
tion is composed almost exclusively of faint late-type galaxies, while in the clusters the 
galaxy population brighter than M bj — 51og 10 /t = — 19 consists predominantly of early 
types 

Is L* a standard candle? In 1962, Abell suggested that the luminosity of the break 
in the luminosity function of rich clusters L*, corresponding to M*, could be used 
as a ‘standard candle’ in the redshift-apparent magnitude relation (Abell, 1962). He 
found excellent agreement with the expected slope of the redshift-magnitude relation 
using this technique. Subsequent studies of the luminosity functions of individual 
clusters of galaxies have show that they are similar in form to the standard Schechter 
function with more or less the same parameters as those described above. Schechter 
found that, if only those clusters for which good fits to his proposed function were 
included, the dispersion in the absolute magnitude of M* was only 0.25 magnitudes, 
as good a result as has been obtained from studies of the brightest galaxies in clusters 
(Schechter, 1976). 

With the availability of the large surveys of galaxies, this proposal has to be 
treated with some caution since there is evidence for the evolution of the form the 
luminosity function, even over remarkably small redshift intervals. As shown in 
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Fig. 3.17. The evolution of the luminosity function of galaxies at small redshifts. Over the 
redshift interval 0 to 0.3, the value of M* becomes more positive by (0.8 ± 0.1) magnitudes, 
that is, the break in the luminosity function moves to fainter intrinsic luminosities as the 
Universe grows older (Baldry et al., 2005) 



Fig. 3.17, the luminosity function as observed in the n -waveband shows significant 
changes over the small redshift interval 0.3 > z > 0.02 (Baldry et al., 2005). 
Specifically, they find that M* decreases by 0.8 ± 0.1 magnitudes between redshifts 
0.3 and zero. 

The brightest galaxies in clusters. At the very highest luminosities, the brightest 
galaxies in clusters do not fit smoothly onto an extrapolation of the Schechter 
luminosity function. These massive galaxies, a splendid example of which can be 
seen in Fig. 4.1, are known as cD galaxies, their characteristic being that they are 
similar to giant elliptical galaxies but in addition possess extensive stellar envelopes. 
They are the most luminous galaxies found in rich clusters and groups of galaxies. It 
appears that these are very special galaxies and not just the most luminous members 
of the luminosity function of galaxies, as was shown statistically by Tremaine and 
Richstone (Tremaine and Richstone, 1977). Evidently, there must be some physical 
reason why the first ranked cluster galaxies have these unique properties; we will 
return to this issue in Chap. 4. 

The luminosity function for low luminosity galaxies. The luminosity function is 
quite poorly known at low luminosities, because these galaxies can only be observed 
in relatively nearby groups and clusters. According to Binggeli and his colleagues, 
the lowest luminosity regions of the luminosity function are exclusively associated 
with irregular and dwarf elliptical galaxies (Binggeli et al., 1988). These conclusions 
are confirmed by analyses of the SDSS, in particular, the analysis of a large sample 
of galaxies with distances in the range 10 < r < 150 Mpc so that galaxies as faint 
as M = — 12.5 can be included (Blanton et al., 2005). These data show an upturn in 
the slope of the luminosity function at very low luminosities, the best-fitting value of 
a being about —1.3. As the authors comment, however, a large number of galaxies 
at very low luminosities may be missing because of their low surface brightnesses 
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and so the true low luminosity slope may be —1.5 or even steeper. In agreement 
with Bingelli and his colleagues, they find that extremely low luminosity galaxies 
are predominantly blue, low surface brightness, exponential disks. 

3.7.2 The Integrated Luminosity and the Mean Mass-to-Luminosity Ratio 
for Visible Matter in the Universe 

An important calculation is the integrated luminosity of all the galaxies within 
a given volume of space. For a cluster of galaxies, the result would be the integrated 
optical luminosity of the cluster; if this were a typical unit volume of space, the 
result would be the luminosity density of the radiation due to all the galaxies in the 
Universe. Although the number of galaxies in the luminosity function diverges at 
low luminosities, the total background light remains finite. The luminosity density 
is 



p OO pOO 

s B( o) = / L<p(L)dL =(p*L* / x a+l eT x d* 

Jo Jo 

= (P*L*r(a + 2), (3.31) 

where F is the gamma-function. For a cluster of galaxies, <p* is the normalisation 
factor in the luminosity function. To estimate the luminosity density of a typical 
volume of space, we can use the values determined by Felten for the field luminosity 
function quoted above, a = —1.25,0* = 1.2 x 10 3 /z 3 Mpc 3 and M* = —20.05 + 
51og 10 /r, corresponding to 1.24 x 10 10 h~ 2 Lq. Then, 

e B(0) = 1.8 x 10 8 /rL Q Mpc -3 . (3.32) 

The value found from the SDSS luminosity function (Blanton et al., 2003) in the 01 r 
waveband is 



(1.84 ± 0.04) x 10 8 /zL q Mpc' 3 . 



(3.33) 



These results are consistent with other estimates of the luminosity density, for 
example from the 2dF Galaxy Redshift Survey (Fig. 3.15a) and the Millennium 
Galaxy Catalogue. 

A useful reference value for cosmological studies is the average mass-to- 
luminosity ratio for the Universe, if it is assumed to have the critical cosmological 
density, q c = 3//q/8jtG = 2.0 x 10 -26 /z 2 kg m 3 . In terms of solar units, the 
mass-to-luminosity ratio would be 



6c 

£b 




= 1600/r 




B 



(3.34) 



Although there is some variation about this estimate, its importance lies in the fact 
that it is significantly greater than the typical mass-to-luminosity ratios of galaxies 
and clusters of galaxies, even when account is taken of the dark matter which must 
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be present. This result indicates that the mass present in galaxies and clusters of 
galaxies is not sufficient to close the Universe. 

It is useful to work out typical values for the mean space density and luminosity 
of galaxies. Using the mean luminosity of galaxies for Felten’s best estimate of the 
luminosity function with a — —1.25, we find (L) — 1.25 L* — 1.55 x 10 w h~ 2 Lq. 
Adopting the mean luminosity density of the Universe given by (3.32), the typical 
number density of galaxies h = £b( 0 )/(L) — 10 2 /z 3 Mpc 3 . In other words, the 
typical galaxies which contribute most of the integrated light of galaxies are separated 
by a distance of about 5 /z _1 Mpc, if they were uniformly distributed in space, which 
we know to be very far from the truth. For reference, galaxies such as our own 
and M31 have luminosities L (ia |(B) ~ 10 10 Lq. Evidently, if the ratio of mass-to- 
luminosity were the same for all galaxies, the ‘mean’ galaxies would also contribute 
most of the visible mass in the Universe. 

These data also enable limits to be placed upon the average mass density in 
stars at the present epoch. In the simplest estimate, we can adopt a typical mass-to- 
luminosity ratio for the visible parts of galaxies of M/L ~ 3 and then the density 
parameter in stars at the present epoch would be T2Ji = 2 x 1 0 3 . A very much 
more careful analysis has been carried out by Bell and his collaborators who used the 
combined SDSS and Two Micron All Sky Survey (2MASS) catalogues of galaxies 
(Bell et al., 2003). The benefit of including the 2MASS data is that the luminosity 
functions can be determined in the relatively unobscured 2 pm waveband. Their 
upper limit to the stellar mass density in the local Universe is 

QJi = (2 ± 0.6) x 10~ 3 , (3.35) 

assuming the initial mass function of stars is as rich in low mass stars as is allowed 
by galaxy dynamics in the local Universe. This is a key result for many aspects of 
galaxy formation. 



3.8 The Properties of Galaxies: 

Correlations Along the Hubble Sequence 

What gives the Hubble classification physical significance is the fact that a number 
of physical properties are correlated with position along the sequence. Many of these 
were reviewed by Roberts and Haynes in an important analysis of the properties of 
a large sample of bright galaxies selected primarily from the Third Reference Cata- 
logue of Bright Galaxies (de Vaucouleurs et al., 1991; Roberts and Haynes, 1994). 
They emphasised that, although there are clear trends, there is a wide dispersion 
about these correlations at any point along the sequence (Fig. 3.18). 

Some of the more important findings of Roberts and Haynes’ survey are as 
follows: 

- Total masses and luminosities. The average masses and range of masses are 
roughly constant for galaxies in classes SO to Scd. At later stages beyond Scd, 
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Fig. 3.18a-g. Global galaxy parameters as a function of stage along the Hubble sequence 
(Roberts and Haynes, 1994). The circles represent the galaxies in the RC3-UGC sample and the 
squares those within the local supercluster of galaxies. The filled circles are medians; the open 
symbols are mean values. The error bars represent the 25 and 75 percentiles of the distributions, 
a Total masses, b Total mass-to-luminosity ratio (Mt/L%)\ c Neutral hydrogen mass to 
total mass (Mhi/Mt); d Neutral hydrogen mass to blue luminosity (Mhi/Tb) 
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Fig. 3.18. (continued) e Total mass surface density (crx); f Surface mass density of neutral 
hydrogen (am); g Integrated (B — V) colour 



the masses of the galaxies decrease monotonically (Fig. 3.18a). The mass-to- 
luminosity ratios of the galaxies in the sample are roughly constant (Fig. 3.18b) 
and so it is no surprise that the average luminosity for the SO to Scd galaxies is 
roughly constant, whilst it decreases monotonically beyond Scd. These relations 
again quantify van den Bergh’s remark that the classical Hubble types refer 
primarily to luminous, and consequently, massive galaxies. 

- Neutral hydrogen. There is a clear distinction between elliptical and spiral galax- 
ies in that very rarely is neutral hydrogen observed in ellipticals while all spiral 
and late-type galaxies have significant gaseous masses. The upper limit to the 
mass of neutral hydrogen in elliptical galaxies corresponds to < 10 4 . 

For spiral galaxies, the fractional mass of the galaxy in the form of neutral hy- 
drogen ranges from about 0.01 for Sa galaxies to about 0.15 at Sm, the increase 
being monotonic along the revised Hubble sequence (Fig. 3.18c). The fractional 
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hydrogen mass is more or less independent of the mass of the galaxy at a par- 
ticular point along the Hubble sequence. A consequence of the constancy of the 
A^tot/Z-B ratio for the galaxies in the sample is that there is also a significant trend 
for the ratio Mhi/Lb to increase along the sequence (Fig. 3. 18d). 

- Total surface density and sutface density of neutral hydrogen. These quantities 
change in opposite senses along the Hubble sequence. The total surface density, 
as determined by the total mass of the galaxy and its characteristic radius, 
decreases monotonically along the sequence (Fig. 3.18e), whereas the surface 
density of neutral hydrogen increases along the sequence (Fig. 3.18f). 

- Integrated colour. There is a strong correlation in the sense that elliptical galaxies 
are red whereas late-type galaxies are blue. This relation is shown quantitatively 
in Fig. 3. 1 8g. Despite the systematic trend, there is a significant dispersion about 
the relation at each point in the sequence. For example, there are Sc galaxies 
which are red. As we will see, the analysis of the very large samples of galaxies 
provided by the SDSS and 2dF Galaxy Surveys have quantified the central 
importance of colour in understanding the astrophysics of galaxies (Sect. 3.9). 

- Luminosity function of H1I regions. In a pioneering study, Kennicutt and his 
colleagues determined the luminosity function of HII regions in different galaxy 
types (Kennicutt et al., 1989). Normalising to the same fiducial mass, it was 
found that there is a much greater frequency of HII regions in the late-type 
galaxies as compared with early-type galaxies and that the relation is monotonic 
along the sequence. 

Morton and Haynes pointed out that an obvious interpretation of these correlations 
is that there are different rates of star formation in different types of galaxy. As 
they express it, the various correlations provide information about the past, current 
and future star formation rates in galaxies. The correlation with colour along the 
sequence is related to the past star formation history of the galaxy; the changes in the 
luminosity function of HII regions refer to star formation rates at the present epoch; 
the large fraction of the mass of neutral hydrogen and its large surface density at 
late stages in the sequence show that these galaxies may continue to have high star 
formation rates in the future. 

To put more flesh on this argument, the integrated colours of galaxies of different 
Hubble types can be plotted on a (U — B, B — V) colour-colour diagram, the colours 
being corrected for internal and external reddening. Such a colour-colour diagram 
for a sample of galaxies selected from the Hubble Atlas of Galaxies is shown in 
Fig. 3.19 in which it can be seen that the colours of galaxies occupy a remarkably 
narrow region of the (U — B, B — V) plane (Larson and Tinsley, 1978). There is 
a monotonic variation of Hubble types along this locus, the bluest galaxies being 
the Sc and Sd galaxies and the reddest the elliptical galaxies, as can be seen from 
comparison with Fig. 3. 1 8g. The colours of the galaxies cannot be represented by 
those of any single class of star which is hardly surprising since different classes of 
star make the dominant contribution at different wavelengths. 

The integrated light of galaxies is principally the sum of the light of main se- 
quence stars plus red giant stars, in particular, the K and M giants. To a rough 
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Fig. 3.19. The colour-colour 
(U-B, B-V) diagram for the 
integrated colours of galaxies 
of different morphological types 
for galaxies selected from the 
Hubble Atlas of Galaxies (Larson 
and Tinsley, 1978) 



approximation, the colours of galaxies can be represented by the sum of the num- 
bers of luminous blue stars on the main sequence and of luminous giants on the 
giant branch. If all the stars in galaxies formed 10 10 years ago, the main sequence 
termination point would now have reached roughly the mass of the Sun, M ~ Mq, 
and the brightest main- sequence stars would have spectral properties similar to that 
of the Sun, that is, a G2 star. There would therefore be no bright blue stars on the 
main sequence and the integrated light of the galaxy would be dominated by red 
giants. On the other hand, if star formation has continued over 10 10 years, or if 
there were a burst of star formation in the recent past, there would be a significant 
population of hot blue stars on the main sequence giving the galaxy a significantly 
bluer colour. 



3.9 The Red and Blue Sequences 

With the availability of the large samples of galaxies from the SDSS and the 2dF 
Galaxy Survey, a more quantitative approach to the classification of galaxies has 
had to be developed, necessitated by the need to analyse these huge samples by 
computer algorithms. What is lost in detail in these computer-based classifications is 
more than compensated for by the huge statistics of galaxies with different properties. 
The upshot of these studies is that what are traditionally referred to as early and late- 
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type galaxies are found to form two distinct sequences which are known as the red 
and blue sequences. In summary: 

- The red sequence consists of non-star-forming, high mass spheroidal galaxies, 
or, more colloquially ‘old, red and dead’ galaxies. 

- The blue sequence consists of star-forming, low mass galaxies which are disc- 
dominated. 

These two sequences are defined by a number of the characteristic properties which 
have already been introduced. 

3.9.1 Colour Versus Absolute Magnitude 

Perhaps the most striking distinction between the two sequences appears in the 
plot of the colour 01 (g — r) against absolute magnitude M. Figure 3.20a shows 
the distribution of these properties for 144,000 galaxies from the SDSS catalogue 
(Blanton et al., 2003). Superimposed on the diagram are isodensity contours, the 
bulk of the galaxies lying within the heavy white contours. The separation into two 
sequences is clearly defined, the oval region at the top of the diagram being the red 
sequence and the broader region towards the bottom right the blue sequence. 

Baldry and his colleagues have shown that the colour distribution of these galax- 
ies can be separated into red and blue sequences which can be can be very well- 
described by Gaussian distributions over the magnitude range —23.5 < M r < 

- 15.75 (Baldry et al., 2004). It is striking how precisely the overall colour distri- 
bution in each bin of absolute magnitude over this wide magnitude range can be 
decomposed into two Gaussian distributions (Fig. 3.21). The red galaxies are the 
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Fig. 3.20. a Illustrating the bimodality in the distribution of colour 01 (g — r ) of galaxies 
as a function of optical absolute magnitude (Blanton et al., 2003). b A plot of Sersic index 
against colour for 10,095 galaxies selected from the Millennium Galaxy Catalogue (Driver 
et al., 2006) 
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Fig. 3.21. Illustrating the bimodality in the distribution of the colours of galaxies as a function 
of optical absolute magnitude for a sample of 66,848 galaxies selected from the Sloan Digital 
Sky Survey (SDSS). The distributions of colours have been fitted by pairs of Gaussians. 
The data have been binned in intervals of 0.1 in the rest frame ( u — r) colour. The galaxy 
distributions are binned in 0.5 magnitude intervals. Only half of the histograms presented by 
the authors are shown (Baldry et al., 2004) 

most luminous, while the blue galaxies form the dominant population at low abso- 
lute magnitudes, as is reflected in the different luminosity functions for red and blue 
galaxies (Fig. 3.15a). 
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3.9.2 Sersic Index and Colour 

Bimodality is also present in the structural properties of the galaxies. As seen 
in Fig. 3.8a, the Sersic index n can be used to divide galaxies into spheroidal- 
dominated and disc-dominated galaxies and this shows up even more dramatically in 
a plot of colour against Sersic index (Fig. 3.20b) (Driver et al., 2006). As discussed 
in Sect. 3.4.3, the spheroid-dominated systems are most commonly found with 
Sersic parameter n — 4, whereas the disc-dominated systems typically have n = 1 . 
There is a clear separation between these systems in Fig. 3.8a, but it is even more 
pronounced in Fig. 3.20b in which the red and blue sequences occupy quite separate 
regions of the diagram. The dividing line between the two sequences occurs about 
n — 2. 

3.9.3 Mean Stellar Age and Concentration Index C 

Another approach to separating galaxies into two sequences is to use measures of 
the age of their stellar populations and the degree of concentration of the light 
towards their centres. Kauffmann and her colleagues have used sample of 122,808 
galaxies from the SDSS to study the average age of their stellar populations using 
the amplitude of the Balmer break, or discontinuity, at 400 nm, D„(4000), and the 
Balmer absorption line index FF>a. The latter measures the strengths of the Balmer 
absorption line which are particularly strong in galaxies which have undergone 
a recent burst of star formation (Kauffmann et al., 2003). They have shown that 
these indices provide good measures of star formation activity over the last 10 9 and 
(1-10) x 10 9 years respectively. 

The concentration index C is defined to be the ratio C — (R90/R50), where R90 
and R50 are the radii enclosing 90% and 50% of the Petrosian r-band luminosity of 
the galaxy. The concentration parameter C is strongly correlated with Hubble type, 
C — 2.6 separating early from late-type galaxies. Those galaxies with concentration 
indices C > 2.6 are early-type galaxies, reflecting the fact that the light is more 
concentrated towards their centres. 

D„ (4000) and H<5 a are plotted against the concentration index C and the mean 
stellar mass density within the half light radius /r* in Fig. 3.22. The panels of that 
diagram show that the galaxy populations are divided into two distinct sequences. 
Kauffmann and her colleagues show that the dividing line between the two se- 
quences occurs at a stellar mass M » 3 x 10 10 Mg. Lower mass galaxies have 
young stellar populations, low surface mass densities and the low concentration 
indices typical of disks. They infer that a significant fraction of the lowest mass 
galaxies have experienced recent starbursts. For stellar masses M > 3 x 10 10 M 0 , 
the fraction of galaxies with old stellar populations increases rapidly. These also have 
the high surface mass densities and high concentration indices typical of spheroids 
or bulges. 
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Fig. 3.22. Density distributions showing the trends of the stellar age indicators D„(4000) and 
H<5 a with concentration index C = (R90/R50) and surface mass density /x* (Kauffmann 
et al., 2003) 



3.9.4 The Effect of the Galaxy Environment 

The differences in morphological types of galaxies found in different galactic envi- 
ronments has already been illustrated in Figs. 3.4 and 3. 16. Another way of presenting 
these data emphasising the distinction between the galaxies in the red and blue se- 
quences was carried out by Hogg and his colleagues (Hogg et ah, 2004). The sample 
consisted of 55,158 galaxies in the redshift interval 0.08 < z < 0.12. The local 
galaxy density about any given galaxy was defined by the quantity <$i x g, meaning 
the overdensity about any galaxy in a cylindrical volume with transverse comoving 
radius 1 h 1 Mpc and comoving half-length along the line of sight of 8 h 1 Mpc. 
Thus, a galaxy in an environment with the average density of galaxies has <$i x g = 0. 
Values of 5i x g > 50 are found in the cores of rich clusters. 

The top row of Fig. 3.23 shows contour plots of the number density of galaxies 
in the colour-absolute magnitude diagram of Fig. 3.21a, but now shown separately 
for different overdensity environments, ranging from low excess number densities, 
<5ix8 < 3, to very high density environments <5i x g > 50. These data quantify the 
statement that red galaxies are found preferentially in rich galaxy environments. The 
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Fig. 3.23. Illustrating the bimodality in the distribution of the colours of galaxies as a function 
of the density of galaxies in which the galaxy is observed and as a function of their structures 
as parameterised by the Sersic index n (Hogg et al., 2004) 



second and third rows further split the sample of galaxies into those with Sersic 
parameters greater and less than 2. These diagrams quantify the statement that red 
spheroidal galaxies are found in the richest cluster regions and these are avoided by 
the blue disc-like galaxies. 

3.9.5 The New Perspective 

The new way of quantifying the physical properties of galaxies developed in this 
section illustrates a profound difference of approach to the study of galaxies as 
compared with, say, ten years ago when the first edition of this book was written. The 
advent of the huge galaxy surveys as represented by the SDSS and 2dF galaxy surveys 
have provided the opportunity to quantify by computer algorithm the properties of 
galaxies which in the past relied somewhat upon the eye of the experienced observer. 
The division of galaxies into members of the blue and red sequences parallels in 
many ways the division into early and late-type galaxies. To a good approximation, 
galaxies earlier than Sa in the Hubble sequence, stage T = 1 in de Vaucouleurs’ 
classification (Table 3.1), are members of the red sequence and galaxies later than 
T — 1 belong to the blue sequence. 
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Of particular importance is the fact that the relative number densities of galaxies 
of different types are now well-established with large statistics and so are ripe for 
comparison with the predictions of theories of galaxy formation. As an example of 
the usefulness of the new statistics, an important result is how the average number, 
luminosity and mass densities of the stellar component of galaxies in the local 
Universe are made up. Bell and his colleagues have shown, for example, that while 
the red sequence contains only 20% of the galaxies by number, these contribute 40% 
of the stellar luminosity density and 60% of the average stellar mass density at the 
present epoch (Bell et al., 2003). 



3.10 Concluding Remark 

This exposition has focussed upon understanding the properties of galaxies at the 
present epoch and has refrained from consideration of the vast amount of data now 
available on samples of galaxies at earlier epochs, or large redshifts, which provide 
clues to their origin and evolution. These topics will be taken up in much more detail 
in Chaps. 17 to 19 once the origin of large-scale structures in the Universe has been 
established. 



4 Clusters of Galaxies 



Associations of galaxies range from pairs and small groups, through the giant clusters 
containing over a thousand galaxies, to the vast structures on scales much greater 
than clusters such as the vast ‘walls’ seen in Figs. 2.7 and 2.8. Clustering occurs 
on all scales, as is demonstrated by the two-point correlation function for galaxies 
(Figs. 2.5 and 2.6). Few galaxies can be considered truly isolated. Rich clusters 
of galaxies are of particular interest because they are the largest gravitationally 
bound systems we know of in the Universe. They possess correspondingly deep 
gravitational potential wells which can be observed through the bremsstrahlung X- 
ray emission of hot gas which forms an atmosphere within the cluster. The hot gas can 
also be detected through the decrements which it causes in the Cosmic Microwave 
Background Radiation as a result of the Sunyaev-Zeldovich effect. 

Clusters, therefore, provide laboratories for studying many different aspects of 
galactic evolution within rather well-defined astrophysical environments. Interac- 
tions of galaxies with each other and with the intergalactic medium in the cluster can 
be studied, as well as the distribution and nature of the dark matter, which dominates 
their dynamics. Radio source events can strongly perturb the distribution of hot gas. 
From the perspective of the formation of large-scale structure, the mass function for 
clusters of galaxies provides constraints on the development of structure on large 
scales and on cosmological parameters. 



4.1 The Large-Scale Distribution of Clusters of Galaxies 

Until relatively recently, the surveys of rich clusters of galaxies which have been the 
focus of most attention resulted from the pioneering efforts of George Abell. More 
recently, clusters have been detected by analysing the distribution of galaxies found 
in machine-scanned surveys of 48-inch Schmidt telescope plates, such as the APM 
and COSMOS cluster surveys. Most recently, rich clusters have been identified in 
the large catalogues of galaxies provided by the Sloan Digital Sky Survey. Another 
approach is to identify clusters of galaxies as extended X-ray sources at high galactic 
latitudes and this has proved to be an effective procedure which is independent of 
the need to identify the individual cluster members (Sect. 4.4). 
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4.1.1 The Abell Catalogues of Rich Clusters of Galaxies 

It is worthwhile recounting the heroic efforts of Abell and his colleagues in creating 
the Abell catalogue of rich clusters of galaxies and its southern counterpart. The 
48-inch Schmidt telescope was constructed on Palomar mountain during the late 
1940s as a wide- field telescope to support and complement observations made with 
the 200-inch telescope. The Palomar Sky Survey took seven years to complete 
and comprised 879 pairs of 14-inch plates, each providing roughly 6° x 6° sky 
coverage and having limiting magnitudes of 21.1 in the blue and 20.0 in the red 
wavebands. Abell was one of the principal observers for the Palomar Sky Survey 
and, while the plates were being taken, he systematically catalogued the rich clusters 
of galaxies appearing on the plates. The word ‘rich’ meant that there was no doubt 
as to the reality of the associations of galaxies appearing on the plates (Abell, 1958). 
A typical example of a rich, regular cluster of galaxies, Abell 2218, is shown in 
Fig. 4.1. A corresponding catalogue for the southern hemisphere was created with 
the completion of the ESO-SERC Southern Sky Survey, which was made with the 
48-inch UK Schmidt Telescope at the Siding Spring Observatory in New South Wales 
(Abell et al., 1989). In both cases, the clusters were found by visual inspection of 
the Sky Survey plates. 

Crucial to the success of Abell’s programme was adherence to the strict set of 
criteria he established for the inclusion of clusters in the catalogue. The 4073 clusters 
in the combined northern and southern catalogue of Abell, Corwin and Olowin fulfil 
the following selection criteria: 

- Richness criterion. The clusters must have 50 members brighter than 2 magni- 
tudes (m3 + 2) fainter than the third brightest member (m3). Richness classes 
are defined by the number of galaxies with magnitudes between m 3 and m3 + 2, 




Fig. 4.1. The rich, regular cluster of galaxies Abell 2218. Abell classified this cluster as 
richness class 4. The central galaxy is a supergiant or cD galaxy, which is very much brighter 
than all the other galaxies in the cluster. It is located close to the dynamical centre of the 
cluster. The image also shows a number of arcs which are the gravitationally tensed images 
of very distant background galaxies (Courtesy NASA, ESA and the Space Telescope Science 
Institute) 
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as described in Table 4. 1 . A richness class 0 is included for clusters which have 
between 30 and 50 members within the same magnitude interval, but the cata- 
logues are not complete for this class. Studies of a number of nearby clusters 
of different richness classes have shown that richness is proportional to the total 
number of galaxies in the cluster. 

- Compactness criterion. Galaxies are only counted within a radius of 1 .5 h~ 1 Mpc 
of the cluster centre. This radius corresponds to an angular radius of 1 .7 /z arcmin, 
the redshift of the cluster being estimated from the apparent magnitude of its 
tenth brightest member, m io- 

- Distance criteria. Abell clusters with redshifts less than 0.02 span more than 
one Sky Survey plate and hence this lower redshift limit was adopted. The upper 
redshift limit is set by the fact that galaxies could not be counted to magnitudes 
fainter than m r — 20 in the northern survey and therefore the third brightest 
galaxy must be brighter than m 3 — 17.5. This redshift limit corresponds to 
z ~ 0.2. From redshift measurements of a number of clusters, it was found that 
m 10 , the apparent magnitude of the tenth brightest cluster member, is a reliable 
distance indicator. The clusters were then divided into distance classes in such 
a way that there was a small probability of the class assigned to the cluster being 
more than one class wrong. 

Within the northern sample, Abell defined a complete statistical sample of 1682 
clusters which fulfilled distance criteria 1 to 6 and richness criteria 1 to 5. The 
numbers of clusters in each richness class in this sample and the total numbers in the 
southern sample are included in Table 4.1. The number density distribution of clusters 
with increasing distance, equivalent to the number counts of clusters, is shown in 
Fig. 4.2 (Batuski et al., 1989). It can be seen that serious incompleteness sets in at 
distances greater than about 600/?^* Mpc, corresponding to redshift z = 0.2, but 
at smaller distances, there is reasonable agreement between the samples of clusters 



Table 4.1. Definitions of the richness classes R of Abell clusters and the numbers of clusters 
within Abell’s complete northern sample of 1682 clusters and the total sample of southern 
clusters. N is the number of galaxies in the cluster between magnitudes m 3 and m 3 + 2 (Abell, 
1958; Bahcall, 1988; Abell et al., 1989) 



Richness 
Class R 


N 


Number of clusters 
in the complete 
northern sample 


Total number of 
clusters in the 
southern sample 


( 0 Y 


(30 - 49) 


(> 10 3 ) 


(664) 


1 


50-79 


1224 


656 


2 


80 - 129 


383 


273 


3 


130- 199 


68 


41 


4 


200 - 299 


6 


1 


5 


300 or more 


1 


0 



The sample is not complete for richness class zero 



Cluster density (h 3 Mpc' 3 ) 
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Fig. 4.2. The average estimated space densities of Abell clusters in the Northern and Southern 
catalogues for richness classes R > 1 as a function of estimated distance D c (Batuski et al., 
1989) 



in the northern and southern hemispheres. The space density of Abell clusters with 
richness classes greater than or equal to 1 is, to a good approximation, 

N C \(R > 1) ^ 10 -5 /z 3 Mpc' 3 , (4.1) 

so that the typical distance between cluster centres, if they were uniformly dis- 
tributed in space, would be ~ 50 h 1 Mpc. These figures can be compared with the 
space density of ‘mean galaxies’ of 10 2 /? 3 Mpc~ 3 and their typical separations of 
5/r 1 Mpc (see Sect. 3.7.2). 

4.1.2 Comparison with Clusters Selected from the Sloan Digital Sky Survey 

The studies of Abell and his colleagues were entirely based upon visual inspec- 
tion of the plates and film copies of the Northern and Southern 48-inch Schmidt 
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Telescope Sky Surveys. Great care was taken to ensure the consistency of the clus- 
ter identifications and classifications over many years. The advent of high speed 
measuring machines with powerful procedures for star-galaxy separation enabled 
more objective approaches to the identification of rich clusters to be developed. It 
is perhaps most instructive to compare Abell’s results with those of the Bahcall and 
her colleagues who have identified the clusters present in the early release com- 
missioning data of the Sloan Digital Sky Survey (SDSS) (Bahcall et al., 2003b). 
The SDSS has the great advantages of having images in five different wavebands as 
well as redshifts for very large samples of galaxies at z < 0.1. As a result, velocity 
dispersions are available for many of the clusters studied, enabling masses to be 
estimated. 

The analyses described by Bahcall and her colleagues do not use all these data but 
employ two of the various algorithms available to isolate clusters of galaxies which 
satisfy criteria similar to Abell’s, but which extend to slightly lower richness classes. 
They find that 53 of the 58 Abell clusters in the area of sky studied are included in 
their merged catalogue of clusters, the remaining five clusters being detected just 
below the richness limits of their catalogue. This is a remarkable testament to the 
quality of the visual approach of Abell and his colleagues. 

An advantage of the procedures adopted by Bahcall and her colleagues is that 
the limits of the catalogue can be expressed in terms of the velocity dispersions 
and masses of the clusters. Thus, the threshold for the inclusion of clusters in 
the catalogue corresponds to typical velocity dispersions o v > 400 km s -1 and to 
masses M within radius 0.6 h 1 Mpc of their centres of M > 5 x 1 0 1 3 /i 1 M 0 . This 
threshold corresponds to richness classes below Abell’s richness class 0. The average 
space density of these rich clusters is 2 x 10 5 /7 3 Mpc -3 , in excellent agreement 
with Abell’s estimate when account is taken of the fact that the sample extends to 
somewhat lower richness limits. 

Bahcall and her colleagues provide a useful table showing the velocity disper- 
sions, masses and luminosities for the samples of clusters of different richnesses in 
their catalogue (Table 4.2). These data provide estimates of the mass-to-luminosity 
ratios for clusters of different richnesses. It can be seen from Table 4.2 that these 
range from ~ 1 70 h to ~ 250 h with increasing richness. In turn, these data can be 
used to determine the mass functions of rich clusters of galaxies and to provide con- 
straints on the mass density parameter f2o and the amplitude of the mass fluctuations 
os (Sects. 8.7 and 14.4). 

4.1.3 Abell Clusters and the Large-Scale Distribution of Galaxies 

Abell clusters are strongly correlated in space, both with each other and with the 
distribution of galaxies in general. These associations were originally described in 
terms of the superclustering of galaxies, but it is preferable nowadays to express 
the clustering in terms of the correlation functions introduced in Sect. 2.2.1. It is 
simplest to quote the results of the SDSS survey of rich clusters by Bahcall and her 
colleagues who include extensive references to earlier work (Bahcall et al., 2003a). 
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Table 4.2. An example of the scaling relations for clusters of galaxies of different richnesses 
from analysis of the commissioning data from the SDSS (Bahcall et al., 2003b). /V ga i is the 
number of red E/SO galaxies one magnitude fainter than L* in the i band within a radius 
of 1 h~ l Mpc of the brightest cluster galaxy where L* = 10 10 h 2 Lq. A is a measure of the 
richness of the cluster and is derived from the best-fit cluster model which has total luminosity 
L c i = AL * , where the luminosity is measured within a radius of 1 h~ l Mpc 
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The correlation functions for the different richness classes described in Table 4.2 
are shown in Fig. 4.3. The best-fit two-point correlation function has the form 

n(r) dF = «o [1 + £(/)] dV , where £(r) = ^ — j , (4.2) 

which is slightly steeper than the value found on smaller physical scales for galaxies 
in general. The data included in Fig. 4.3 show that the correlation length is 
much greater than that for galaxies in general. Bahcall and her colleagues find 
that the variation can be described by the relation ro = 2.6 \fd for (20 < d < 
90 )h~ l Mpc, where d is the mean comoving separation distance of the clusters 
belonging to different richness classes. Thus, the correlation lengths for these rich 
clusters are large, on the scale of the huge voids seen in the distribution of galaxies 
in general. 

The rich clusters are generally found in the densest regions in the ‘cosmic 
web’ seen in Figs. 2.7 and 2.8. Some measure of the association of Abell clusters 
with galaxies in general has been provided by the analysis of Seldner and Peebles 
(Seldner and Peebles, 1977). They cross-correlated the counts of galaxies in the 
Shane- Wirtanen catalogue, which extended to apparent magnitude m ~ 1 9, with 
the positions of Abell clusters of distance class 5. The cross-correlation function was 
found to be of the form 

§ gc = Ar~ 2A , 

out at a distance of 15/z 1 Mpc from the cluster centres. This function represents 
the correlation of the Abell clusters with galaxies in the vicinity of the cluster, but 
outside what would normally be considered to be the cluster boundary; Seldner and 
Peebles refer to this phenomenon as the superclustering of galaxies about Abell 
clusters. This result is in agreement with the visual impression of the distribution of 
galaxies seen in Figs. 2.7 and 2.8. 
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Fig. 4.3. The two-point spatial correlation function for four richness thresholds (_/V ga i > 10, 
> 13, > 15, > 20) for clusters selected from the SDSS (Bahcall et al., 2003a). Best-fit 
functions with slope 2 and correlation-scale ro are shown by the dashed lines. The error bars 
show the 1 a uncertainties in the estimates. The values of d for the four panels are 26.2, 35.6, 
41.5 and 58.1 Mpc with increasing richness 



4.2 The Distribution of Galaxies in Clusters of Galaxies 

Clusters of galaxies come in a variety of shapes and forms and various schemes have 
been developed to put some order into this diversity. Just as in the case of galaxies, 
modern computer-based systems of classification bring new, quantitative insights 
into the wealth of detail contained in the visual classification of clusters. 

4.2.1 The Galaxy Content and Spatial Distribution of Galaxies in Clusters 

Abell classified clusters as regular if they are more or less circularly symmetrical with 
a central concentration, similar in structure to globular clusters (Abell, 1962). These 
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are among the richest clusters, generally containing over 1000 members. Examples 
include the Coma cluster. The galaxies are predominantly elliptical and SO galaxies 
as has been indicated by the results shown in Figs. 3.4 and 3.16. Abell called all the 
other clusters in which there is much less well-defined structure irregular clusters. 
In his words, the irregular clusters 

. . . have a more nearly amorphous appearance, and possess little or no 
spherical symmetry or central concentration. The irregular clusters some- 
times, however, have several small subcondensations or nuclei and resemble 
loose swarms of small clusters. 

Examples of the latter include the Hercules and Virgo clusters. 

Oemler studied systematically a representative sample of fifteen Abell clusters of 
different richness classes and morphological types and distinguished three principal 
types of cluster according to their galaxy content (Oemler, 1974): 

- cD clusters have a unique and dominant cD galaxy and the ratio of elliptical (E) 
to lenticular (SO) to spiral (S) galaxies is roughly 3:4:2, that is, only about 
20% are spirals. These clusters are very rich in E and SO galaxies. 

- Spiral-rich clusters have galaxy-type ratios E : SO : S roughly 1:2:3, that is, 
about 50% spirals, a distribution similar to the proportions of types found in the 
general field. 

- The remaining clusters are called spiral-poor clusters. They have no dominant 
cD galaxy and have galaxy type ratios E : SO : S roughly 1:2:1. 

Abell noted that there is a correlation between the structure of clusters and galaxy 
content and this result was quantified by Oemler who established the following 
relations: 

- In cD clusters or regular clusters, the spatial distribution of galaxies resembles 
the distribution of stars in a globular cluster (see Sect. 4.2.2). The space density 
of galaxies increases rapidly towards the centre of the cluster. The spiral-rich 
clusters and irregular clusters are not symmetric and there is little central con- 
centration; the spatial density of galaxies is roughly uniform towards the central 
regions and is lower than that in cD clusters. The spiral-poor clusters are inter- 
mediate between these two extremes. 

- In the case of spiral-rich clusters, the radial distribution of elliptical, lenticular 
and spiral galaxies is more or less the same. In cD and spiral-poor clusters, 
however, the relative space density of spiral galaxies decreases markedly towards 
the central regions, reflecting Dressler’s correlation of galaxy type with galaxy 
number density (Dressier, 1984). In these clusters, the spiral galaxies form a halo 
around a central core of elliptical and SO galaxies. There is therefore segregation 
by galaxy type in cD and spiral-poor clusters. 

- In addition to evidence for segregation by galaxy type, there is some evidence 
for mass segregation as a function of cluster type. Adopting apparent magnitude 
as a measure of mass, Oemler found that, in cD and spiral-poor clusters, the 
massive galaxies are located closer to the centre than less massive galaxies. 
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However, this mass segregation is only important for the brightest members of 
clusters, m < m \ +2, where in \ is the apparent magnitude of the brightest galaxy; 
galaxies fainter than m i + 2 appear to be uniformly distributed throughout the 
clusters. No such mass segregation is found in spiral-rich clusters. 

As discussed in Sects. 3.7.1 and 4.2.3, the cD galaxies found in the centres of 
cD clusters have a number of features which distinguish them from giant elliptical 
galaxies (Kormendy, 1982). Their most distinctive features are their extensive stellar 
envelopes which can be as large as 100 kpc. In addition to being found in rich clusters, 
examples of cD galaxies are known in small groups of galaxies but, in all cases, they 
are found in regions of significantly enhanced galaxy density relative to the general 
field. Dressier found that cD galaxies are only found in regions in which the galaxy 
density exceeds 1 h 3 galaxy M pc 3 , compared with an average galaxy density of 
10 2 /i 3 Mpc~ 3 (Dressier, 1984). According to Dressier, the local galaxy density 
rather than the richness of the cluster determines whether or not a cD galaxy is 
present in a group or cluster. Some of the central cD galaxies in rich clusters are 
also distinctive in having multiple nuclei which are found in about 25-50% of all 
cD galaxies. In contrast, multiple nuclei are an order of magnitude less common in 
second and third ranked cluster members. 

These results are important in understanding the dynamical evolution of clus- 
ters. Regular, cD clusters are systems which have had time to relax to dynamical 
equilibrium, whereas the other systems are still in the process of relaxation. 

4.2.2 Clusters of Galaxies and Isothermal Gas Spheres 

In regular clusters, the space density of galaxies increases towards the central regions, 
which are referred to as the cores of the clusters. Outside the core, the space density 
of galaxies decreases steadily until it disappears into the background of unrelated 
objects. The regular structures of these clusters suggests that they have relaxed to 
a stationary dynamical state similar to that found in the distribution of stars in globular 
clusters. The spatial distribution of galaxies in a regular cluster can be modelled by 
the distribution of mass in an isothermal gas sphere. These distributions are important 
in the discussion which follows and so let us derive the relevant expressions for the 
density distribution of an isothermal gas sphere. 

The term isothermal means that the temperature, or mean kinetic energy of the 
particles, is constant throughout the cluster. In physical terms, this means that the 
velocity distribution of the galaxies is Maxwellian with the same velocity dispersion 
(or temperature) throughout the cluster. If all the galaxies had the same mass, the 
velocity dispersion would be the same at all locations within the cluster. This is 
a rather sweeping approximation since it would mean that there had been enough time 
for the galaxies to have exchanged kinetic energy and come to a thermal equilibrium 
velocity distribution. Although a good case can be made that the galaxies have had 
time to ‘virialise’, that is, to satisfy the virial theorem when the cluster formed, it 
takes much longer for energy exchange to take place, except for the most massive 
galaxies in the cluster. Nonetheless, let us work out the density distribution of an 
isothermal gas sphere as a reference model for comparison with the observations. 
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We need the Lane—Emden equation , which describes the structure of a spherically 
symmetric object, such as a star, in hydrostatic equilibrium. The requirement of 
hydrostatic equilibrium is that, at all points in the system, the attractive gravitational 
force acting on a mass element g dV at radial distance r from the centre of the system 
is balanced by the pressure gradient at that point, 



dp GMq 

dr r 2 



where M is the mass contained within radius r, 



M = f 4jtr 2 g(r) dr d M — 4;r r 2 g(r) dr . 

Jo 

Reordering (4.3) and differentiating, we find 



r 2 dp d ( r 2 d/A d M 

— = —GM — — )=-G 

g dr dr \ g dr / dr 



d 

dr 



(Sd p\ 

V Q d r) 



+ 4 7iGr 2 g = 0 . 



(4.3) 



(4.4) 



(4.5) 



Equation (4.5) is the Lane-Emden equation. We are interested in the case in which 
the pressure p and the density g are related by the perfect gas law at all radii 
r, p = gkT/p,, where /x is the mass of an atom, molecule or galaxy. In thermal 
equilibrium, | kT = \n{ir), where (v 2 ) is the mean square velocity of the atoms, 
molecules or galaxies. Therefore, substituting for p. 
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dr 



r 2 dg\ 

g dr] 



+ 



4itGg 

kT 



r 2 g = 0 . 



(4.6) 



Equation (4.6) is a non-linear differential equation and, in general, must be solved 
numerically. There is, however, an analytic solution for large values of r. If gir) is 
expressed as a power series in r, gir) = A A„r~", there is a solution for large r 
with n — 2, 



2 ( 4ttGu\ 

6{r) = AA WherC A = ' (4 ' 7) 

This mass distribution has the unfortunate property that the total mass of the cluster 
diverges at large values of r, 

f OO poo 

I 4 Ttr 2 g(r) dr = / — dr — > oo . (4.8) 

Jo Jo ^ 

There are, however, at least two reasons why there should be a cut-off at large 
radii. First of all, at very large distances, the particle densities become so low that the 
mean free path between collisions is very long. The thermalisation time-scales con- 
sequently become greater than the time-scale of the system. The radius at which this 
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occurs is known as Smoluchow ski’s envelope. Secondly, in astrophysical systems, 
the outermost stars or galaxies are stripped from the system by tidal interactions 
with neighbouring systems. This process defines a tidal radius r, for the cluster. 
Therefore, if clusters are modelled by isothermal gas spheres, it is perfectly permis- 
sible to introduce a cut-off at some suitably large radius, resulting in a finite total 
mass. 

It is convenient to rewrite (4.6) in dimensionless form by writing q = Qoy, where 
go is the central mass density, and introducing a structural index or structural length 
a, where a is defined by the relation 

« = tt \ 1/2 • (4.9) 

(Ag 0 ) 1/2 

Distances from the centre can then be measured in terms of the dimensionless 
distance x = r/a. Then, (4.6) becomes 

d 
dx 



> d(log v) 

dx 



+ x"y = 0 . 



(4.10) 



Two versions of the solution of (4.10) are listed in Table 4.3 and illustrated in 
Fig. 4.4. In column 2, the solution of y as a function of distance x is given; in the 
third column, the projected distribution onto a plane is given, this being the observed 
distribution of a cluster of stars or galaxies on the sky. It is a simple calculation to 
show that, if q is the projected distance from the centre of the cluster, the surface 
density N(q) is related to y(x) by the integral 

= < 4 ‘» 

Inspection of Table 4.3 and Fig. 4.4, shows that a is a measure of the size of 
the core of the cluster. It is convenient to fit the projected distribution N(q) to the 
distribution of stars or galaxies in a cluster and then a core radius for the cluster can 
be defined. It can be seen that the projected density falls to the value N(q) = 1/2 at 
q = 3, that is, at a core radius R \/2 = 3a. R \/2 is a convenient measure of the core 
radius of the cluster. 

Having measured R 1 / 2 , the central mass density of the cluster can be found if 
the velocity dispersion of the galaxies in this region is also known. From Maxwell’s 
equipartition theorem, \pt{ v 2 ) = %kT and therefore, from the definition of a, 



1 _ kT 

Ago AjzGgLQo 



\2nGQQ 



(4.12) 



Observationally, we can only measure the radial component of the galaxies’ velocities 
V|| . Assuming the velocity distribution of the galaxies in the cluster is isotropic, 
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Fig. 4.4. The density distribution yfr) and the projected density distribution N(q) for an 
isothermal gas sphere 



Table 4.3. The density distribution y(x) and the projected density distribution N(q) for an 
isothermal gas sphere 



x, q 


y(x) 


N(q) 


x, q 


y(x) 


N(q) 


0 


1.0 


1.0 


12 


0.0151 


0.0839 


0.5 


0.9597 


0.9782 


14 


0.0104 


0.0694 


1.0 


0.8529 


0.9013 


16 


0.0075 


0.0591 


1.5 


0.7129 


0.8025 


20 


0.0045 


0.0457 


2 


0.5714 


0.6955 


30 


0.0019 


0.0313 


3 


0.3454 


0.5033 


40 


0.0010 


0.0229 


4 


0.2079 


0.3643 


50 


0.0007 


0.0188 


5 


0.1297 


0.2748 


100 


1.75 x 10" 4 


0.0101 


6 


0.0849 


0.2143 


200 


5.08 x 10“ 5 


0.0053 


7 


0.0583 


0.1724 


300 


2.32 x 10“ 5 


0.0036 


8 


0.0418 


0.1420 


500 


8.40 x 10“ 6 


0.0021 


9 


0.0311 


0.1209 


1000 


2.0 x 10“ 6 


0.0010 


10 


0.0238 


0.1050 









Expressing the central density qq in terms of R\/2 and (pjj), we find 
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(4.14) 
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Thus, assuming the central density distribution of a cluster can be represented by 
an isothermal gas sphere, we can find the central mass density of the cluster by 
measuring (i)m> and R\/ 2 - 

Improved versions of the isothermal sphere model were evaluated by King and 
these have provided good fits to the light distributions of globular clusters and 
galaxies (King, 1966, 1981). The models were originally derived from studies of the 
solutions of the Fokker-Planck equation which describes the distribution function 
f(v, r) for the stars in a cluster under the condition that there should be no particles 
present with velocities which enable them to escape from the cluster. This might 
occur for two reasons. Either the stars have velocities which exceed the escape 
velocity from the cluster, or the stars travel to distances greater than the tidal radius 
of the cluster when they are lost from the cluster because of tidal forces. In either 
case, the cluster can be modelled as a truncated isothermal gas sphere in which none 
of the stars can have velocities exceeding some value v e . This is implemented by 
truncating the Maxwell velocity distribution at this velocity which in turn results in 
models with finite tidal radii r t . The luminosity profiles for such clusters are shown 
in Fig. 4.5, the models being parameterised by the quantity log r t /r c , the logarithm 
of the ratio of the tidal and core radii. In the limit r t /r c -> oo, the models become 
isothermal gas spheres. 

The models which best represent elliptical galaxies are not too dissimilar from 
isothermal gas spheres. In terms of King’s models, the giant elliptical galaxies 




Fig. 4.5. King models for the distribution of stars in globular clusters, galaxies or of galxies 
in clusters of galaxies (King, 1966, 1981 ). The curves show the projected distribution of stars 
or galaxies, equivalent to N(q) in Table 4.3, and are parameterised by the quantity log(r t /r c ) 
where r t is the tidal radius and r c the core radius. The arrows indicate log r t 
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have log i\/r c ~ 2.2; for dwarf elliptical galaxies, the surface brightness decreases 
somewhat more rapidly in the outer regions and King models having small values 
of log r t /r c can give a good representation of the data. These profiles have also been 
found to provide a good description of the distribution of galaxies in clusters. 

According to Bahcall, the observed distribution of galaxies in regular clusters 
can be described by truncated isothermal distributions N(r ) of the form 



N(r) = No[f{r) - C] , 



(4.15) 



where f(r) is the projected isothermal distribution normalised to fir) — 1 at r = 0 
and C is a constant which reduces the value of N(r ) to zero at some radius R/ t 
such that /(/?;,) = C (Bahcall, 1977). For regular clusters core radii lie in the range 
R | /2 = 150 — 400 kpc, the Coma cluster having R \ /2 = 220 kpc. Bahcall found 
that there is a relatively small dispersion in the values of C required to provide 
a satisfactory fit to the profiles of many regular clusters, typically the value of C 
corresponding to about 1.5% of the isothermal central density. 

Many different density distributions have been proposed to describe the space 
density distribution of galaxies in clusters in addition to the above examples. These 
include models such as de Vaucouleurs’ law for elliptical galaxies (equations 3.2 
or 3.3) as well other possibilities such as the Plummer model which is derived from 
a gravitational potential with a core radius b of the form 



GM 

(r 2 + Z? 2 ) 1 / 2 



(4.16) 



where M is the total mass of the system. Using Poisson’s law for gravity in spherical 
polar coordinates, we find the density distribution from 



V 2 0 




= 4t tGq . 



(4.17) 



Then, 



Q(r) = 



3 M 
4-jtb 3 




(4.18) 



Binney and Tremaine discuss these and other possibilities (Binney and Tremaine, 
1987). 



4.2.3 The Luminosity Function for Cluster Galaxies 

The luminosity functions for cluster galaxies can be represented by the Schechter 
function introduced in Sect. 3.7, but, as indicated by Fig. 3.16, there are significant 
differences in the shapes of the luminosity functions for early and late-type galaxies 
as compared with the general field, or the even more extreme case of the void regions. 
These functions are another way of presenting Dressler’s results concerning the 
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different types of galactic populations found in environments of different densities 
(Fig. 3.4). 

In Sandage’s analysis of the redshift-apparent magnitude relation, he demon- 
strated that there is remarkably little scatter in the absolute magnitudes of the brightest 
members of rich clusters of galaxies, the standard deviation of the brightest clus- 
ter galaxies amounting to a M = 0.28 (Sandage, 1988) (see Fig. 2.11). There was 
some controversy about whether or not the constancy of the absolute magnitude of 
the first-ranked cluster galaxy could be explained by randomly sampling the high 
luminosity region of the luminosity function, or whether they possess some special 
property which is independent of the cluster richness. 

Tremaine and Richstone compared the dispersion in absolute magnitudes of the 
first-ranked members o( M \ ) with the mean value of the difference in magnitude 
between the first and second ranked members AM 12 = ( M\ — Mfi) (Tremaine 
and Richstone, 1977). They showed that for any statistical luminosity function 
a( /V/ 1 ) / A M 1 2 = fi > 1 must hold. For example, using Schechter’s function, 
t\ — 1.20. For Sandage’s data on rich clusters, t\ — 0.48 ±0.10, supporting his point 
of view that there is much less dispersion in the absolute magnitudes of first ranking 
cluster galaxies than would be expected if they were simply randomly sampled from 
the luminosity function. Geller and Postman failed to confirm this result (Geller and 
Postman, 1983). Nonetheless, as discussed in Sect. 4.2.1, the cD galaxies are quite 
distinct from normal giant elliptical galaxies (Kormendy, 1982). 

A further classification scheme, due to Bautz and Morgan also bears upon the 
issue of the origins of the brightest galaxies in clusters (Bautz and Morgan, 1970). 
In the Bautz-Morgan classification scheme, clusters are classified according to the 
presence or absence of a dominant D or cD galaxy at the centre of the cluster. 
Bautz-Morgan class I clusters contain a dominant centrally located cD galaxy (for 
example Abell 2199); class II clusters have a central galaxy intermediate between 
a cD and giant elliptical galaxy (for example, the Coma cluster); class III clusters 
have no dominant central galaxy. Intermediate classes between classes I, II and III 
have been defined. A number of properties of clusters depend upon Bautz-Morgan 
class. For example, Sandage has found that there is a weak correlation between 
Bautz-Morgan class and the absolute magnitude of the brightest cluster member 
which cannot account for all the difference between the Bautz-Morgan classes. 
There must in addition be an inverse correlation between Bautz-Morgan class and 
the absolute magnitudes of the second and third brightest members in the sense that 
they are relatively brighter in Bautz-Morgan class III clusters. This phenomenon is 
illustrated by the data in Table 4.4 which lists the mean absolute magnitudes of the 
first, second and third brightest galaxies in a large sample of rich clusters studied by 
Sandage and Hardy (Sandage and Hardy, 1973). 

4.2.4 Summary of the Properties of Rich Clusters of Galaxies 

There are some clear trends in the overall properties of clusters of galaxies. A se- 
quence of types can be defined based on Abell’s distinction between regular and 
irregular clusters with the addition of an intermediate class. Table 4.5 is adapted 
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Table 4.4. The absolute magnitudes of the first, second and third ranked cluster galaxies as 
a function of Bautz-Morgan type (Sandage 1976). It is assumed that h = 0.5 



Bautz-Morgan 

Class 


Mi 


m 2 


m 3 


I 


-23.09 ±0.051 


-22.72 ± 0.077 


-22.47 ±0.091 


I-II 


-23.30 ±0.068 


-22.83 ± 0.091 


-22.56 ±0.121 


II 


-23.37 ±0.130 


-22.60 ±0.217 


-22.34 ±0.212 


II-III 


-23.46 ±0.092 


-22.18 ±0.247 


-21.96 ±0.233 


III 


-23.68 ±0.102 


-22.22 ±0.157 


-21.82 ±0.187 



Table 4.5. A summary of the typical properties of rich cluster of galaxies of different types 
(Bahcall, 1977) 



Property/Class Regular Intermediate Irregular 



Bautz-Morgan type 
Galaxy content 
E : SO : S ratio 
Symmetry 

Central concentration 
Central profile 
Mass segregation 

Examples 



I, I II, II 

Elliptical/SO-rich 

3:4:2 

Spherical 

High 

Steep gradient 
Marginal evidence 
for m — m( 1) < 2 
Abell 2199, Coma 



(II), II-III 
Spiral-poor 
1:4:2 
Intermediate 
Moderate 
Intermediate 
Marginal evidence 
for m — m( 1) < 2 
Abell 194, 539 



(II-III), III 
Spiral-rich 
1:2:3 

Irregular shape 
Very little 
Flat gradient 
No segregation 

Virgo, Abell 1228 



from Bahcall’s review of 1977 and summarises many of the properties described 
above as a function of cluster type (Bahcall, 1977). Like the Hubble sequence for 
galaxies, these types are only part of a continuous sequence and there is considerable 
overlap in some of the properties. 



4.3 Dynamical Estimates of the Masses of Clusters of Galaxies 

It might seem that the measurement of the masses of clusters of galaxies is rela- 
tively straightforward. The virial theorem (3.20) provides a simple relation between 
the mass of the cluster, the radial velocity dispersion of the galaxies (iV } and the 
characteristic radius R c \ of the galaxy distribution. The problems arise in ensuring 
that the galaxies really belong to the cluster and are not random coincidences of 
foreground or background objects and also that large enough samples of radial ve- 
locities are available. These problems are now being addressed by projects such as 
the Sloan Digital Sky Survey in which radial velocities, colours and structural data 
are available for large samples of clusters. The results presented in Table 4.2 give 
some impression of the wealth of data which will become available for many clusters 
over the coming years. 
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Fig. 4.6. An optical image of the central region of the Coma cluster of galaxies, Abell 1656, 
showing the two central massive galaxies NGC 4889 and NGC 4874. North is to the right of 
this image. (Courtesy of Dr. Robert Lupton and the SDSS consortium) 



As a case study, let us consider the Coma cluster (Fig. 4.6) which has always 
been regarded as the archetype of a rich regular cluster of galaxies; we will find 
that this is an overoptimistic assertion. The surface density distribution of galaxies 
in the Coma cluster and the variation of their velocity dispersion with radius were 
determined in a classic paper by Kent and Gunn who assembled radial velocities for 
about 300 cluster members (Kent and Gunn, 1982). The projected surface density 
of galaxies increases smoothly towards the centre and can be described by a King 
profile with tidal radius i\ — 1 6h 1 Mpc (Fig. 4.7a). The inference is that the cluster 
has relaxed to a bound equilibrium configuration, as is confirmed by comparing the 
crossing time of a typical galaxy in the cluster with the age of the Universe. The 
crossing time is ^cr — R/(v) where R is the size of the cluster and (v) is the mean 
random velocity of galaxies. For the Coma cluster, taking {v} = 10 3 kms 1 and 
R — 2 Mpc, the crossing time is about 2 x 10 9 years, roughly a tenth the age of the 
Universe and so the cluster must be gravitationally bound. 

These data were further analysed in detail by Merritt who considered a wide 
range of possible models for the mass distribution within the cluster (Merritt, 1987). 
In the simplest reference model, with which the others can be compared, it is assumed 
that the mass distribution in the cluster follows the galaxy distribution, that is, the 
mass-to-luminosity ratio is a constant throughout the cluster, and that the velocity 
distribution is isotropic at each point in the cluster. With these assumptions, Merritt 
derived a mass for the Coma cluster of 1.79 x 10 l5 /f 1 Af 0 , assuming that the cluster 
extends to 16/z 1 Mpc. The mass within a radius of 1 h~ l Mpc of the cluster centre 
is 6.1 x 10 14 Mq. The corresponding value of the mass-to-blue luminosity ratio for 
the central regions of the Coma cluster is about 350 h M q /Lq. The population of 
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Fig. 4.7. a The surface density profile for the distribution of galaxies in the Coma cluster 
according to Kent and Gunn, b The projected velocity dispersion as a function of radius for 
galaxies in the Coma cluster (Kent and Gunn, 1982) 



galaxies in the central region of the Coma cluster is dominated by elliptical and SO 
galaxies for which the typical mass-to-luminosity ratios are about 10 — 20 M Q /L Q . 
There is therefore a discrepancy of about a factor of 20 between the mass which can 
be attributed to galaxies and the total mass which must be present. This is the result 
which was discovered by Zwicky (Zwicky, 1937). 

This is also where the trouble begins. Dark matter dominates the mass of the 
cluster but there is no reason why it should have the same distribution as the 
visible matter. Likewise, there is no reason a priori why the velocity distribu- 
tion of the galaxies should be isotropic. This is most simply expressed in terms 
of the velocity dispersion of the galaxies in the radial and circumferential direc- 
tions, er, 2 (r), <r,;(r) and cr^(r) within the cluster. The assumption of isotropy is that 
er 2 (r) = cr|(r) = cr^fr). If, however, the galaxies were on circular orbits about the 
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c log R/R c 

Fig. 4.7. (continued) c Three possible velocity dispersion profiles which are consistent with 
the data of Gunn and Kent, A corresponding to an isotropic velocity distribution, while B and 
C involve velocity anisotropies (Merritt, 1987) 



centre, a ~{r) = 0, or, if they were purely radial, crg(r) = cr^(r) = 0. Kent and 
Gunn pointed out that the extreme radial models are inconsistent with the variation 
of velocity dispersion with radius shown in Fig. 4.7. Merritt carried out a care- 
ful study of how the inferred mass-to-luminosity ratio would change for a wide 
range of different assumptions about the relative distributions of the visible and 
dark matter and the anisotropy of the velocity distribution (Fig. 4.7c) (Merritt, 
1987). For the cluster as a whole, the mass-to-luminosity ratio varied from about 
0.4 to at least three times the reference value, while the mass-to-luminosity ratio 
within the core of the cluster at 1 h~ [ Mpc was always very close to 350 IiMq/Lq. 
There can be no doubt that the dynamics of the cluster are dominated by dark 
matter. 

Perhaps the most remarkable result of recent times has been the finding that the 
Coma cluster is probably not the quiescent regular cluster it appears to be. Colless 
and Dunn have added 243 more radial velocities to the sample, bringing the total 
number of cluster members with radial velocities to 450 (Colless and Dunn, 1996). 
They find compelling evidence that, in addition to the main body of the cluster, there 
is a distinct subcluster whose brightest member is NGC 4839. The main cluster has 
mass 0.9 x 1 0 1 5 /z " 1 Mq, while the less massive cluster has mass 0.6 x 1 0 1 4 /z 1 M 0 . 
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Fig. 4.8. An X-ray image of the Coma cluster of galaxies obtained by the XMM-Newton 
Observatory, showing the X-ray emission associated with the main body of the Coma cluster 
and the smaller cluster associated with NGC 4839 (Courtesy of the Max Planck Institute for 
Extraterrestrial Physics and ESA) 



These clusters are clearly seen in the XMM-Newton X-ray image of the Coma cluster 
and the total masses derived from the X-ray observations agree with those derived 
by Colless and Dunn (Fig. 4.8). In addition, Colless and Dunn inferred that there is 
subclustering of galaxies about the two brightest members NGC4889 and NGC4874 
and that these are in the process of coalescing in the central regions of the cluster. 
The moral of this story is that appearances can be deceptive. 



4.4 X-Ray Observations of Hot Gas in Clusters of Galaxies 

The X-ray image of the Coma cluster (Fig. 4.8) provides an excellent demonstration 
of the power of X-ray astronomy in the study of clusters of galaxies. One of the most 
important discoveries made by the UHURU X-ray Observatory was the detection of 
intense X-ray emission from rich clusters of galaxies. The nature of the emission was 
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soon identified as the bremsstrahlung of hot intracluster gas, the key observations 
being the extended nature of the emission and the subsequent detection of the highly 
ionised iron line FeXXVI by the Ariel-V satellite (Mitchell et al., 1976). It was 
quickly appreciated that the X-ray emission of the gas provides a very powerful 
probe of the gravitational potential within the cluster enabling the distribution of hot 
gas and the total gravitating mass to be determined. Let us repeat the calculation 
presented by Fabricant, Lecar and Gorenstein which shows how this can be done 
(Fabricant et al., 1980). 

For simplicity, we assume that the cluster is spherically symmetric so that the 
total gravitating mass within radius r is M(< r). The gas is assumed to be in 
hydrostatic equilibrium within the gravitational potential defined by the total mass 
distribution in the cluster, that is, by the sum of the visible and dark matter as well 
as the gaseous mass. If p is the pressure of the gas and q its density, both of which 
vary with position within the cluster, the requirement of hydrostatic equilibrium 
is 



dp GM{< r)g 

dr r 2 



(4.19) 



The pressure is related to the local gas density q and temperature T by the perfect 
gas law 



gkT 

P = 

pm H 



(4.20) 



where mu is the mass of the hydrogen atom and /i is the mean molecular weight of 
the gas. For a fully ionised gas with the standard cosmic abundance of the elements, 
a suitable value is /x = 0.6. Differentiating (4.20) with respect to r and substituting 
into (4.19), we find 



gkT /Id g 1 dT\ GM{< r)g 

gunu \Q dr T dr ) r 2 

Reorganising (4.21), 

d(logg) d(log 7) 
dr dr 



M(< r) = - 



kTr 2 

Gpunu 



(4.21) 



(4.22) 



Thus, the mass distribution within the cluster can be determined if the variation 
of the gas density and temperature with radius are known. Assuming the cluster is 
spherically symmetric, these can be derived from high sensitivity X-ray intensity 
and spectral observations. A suitable form for the bremsstrahlung spectral emissivity 
of a plasma is 
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where N e and N are the number densities of electrons and nuclei respectively, Z is 
the charge of the nuclei and g(v, T) is the Gaunt factor, which can be approximated 
by 



The spectrum of thermal bremsstrahlung is roughly flat up to X-ray energies 
s — hv ~ kT, above which it cuts off exponentially (Longair, 1997b). Thus, by 
making precise spectral measurements, it is possible to determine the temperature 
of the gas from the location of the spectral cut-off and the column density of the hot 
gas from the X-ray surface brightness. In practice, the spectral emissivity has to be 
integrated along the line of sight through the cluster. Performing this integration and 
converting it into an intensity, the observed surface brightness at projected radius a 
from the cluster centre is 



Cavaliere noted that this is an Abel integral which can be inverted to find the emis- 
sivity of the gas as a function of radius (Cavaliere, 1980) 



X-rays maps of about 200 clusters were made by the Einstein X-ray Observa- 
tory (Forman and Jones, 1982) and more recently analyses of the ROSAT All Sky 
Survey have resulted in the NORAS and REFLEX catalogues of clusters of galaxies 
identified solely through the requirement that the extragalactic X-ray sources have 
extended structures (Bohringer et al., 2000, 2001). In the northern NORAS sample, 
it was found that 76% of the extended X-ray sources were indeed associated with 
clusters of galaxies. In addition, the ROSAT, Chandra and XMM-Newton X-ray 
Observatories have produced beautiful X-ray maps and spectroscopic studies of the 
hot gas in clusters. 

An example of the quality of data now available is illustrated by the X-ray 
map of the central regions of the Virgo cluster as observed in the ROSAT All Sky 
Survey (Fig. 4.9a). A number of galaxies belonging to the Virgo cluster have been 
detected as X-ray sources, as well as a few background clusters and active galaxies. 
In addition, the X-ray emission of the diffuse intergalactic gas is roughly centred 
on the massive galaxy M87. Evidence that the intergalactic gas traces the mass 
distribution of the cluster is provided by comparison of the contours of the X-ray 
surface brightness distribution with the surface distribution of galaxies as determined 
by the photometric survey of the Virgo cluster by Binggeli, Tammann and Sandage 
(Binggeli et al., 1987; Schindler et al., 1999). The distribution of galaxies in the 
cluster and the diffuse X-ray emission are remarkably similar (Fig. 4.9b). In both 
wavebands the irregular structure of the Virgo cluster can be decomposed into three 
major subclusters centred on M87, M49, and M86. In the M87 subcluster the gas 
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Fig. 4.9a, b. Comparison of the X-ray surface brightness distribution and the surface density 
of luminous matter in galaxies in the Virgo cluster, a The X-ray image of the Virgo cluster 
from the ROSAT All Sky Survey in the X-ray energy band 0.4 — 2 keV. The image has been 
smoothed with a Gaussian filter with o = 24 arcmin at the faintest levels and with decreasing 
filter size with increasing surface brightness. Some galaxies in the Virgo cluster have been 
detected as well as a few distant clusters and active galaxies. (Courtesy of the Max Planck 
Institute for Extraterrestrial Physics, Garching.) b The contours show the number density of 
the 1292 member galaxies in the Virgo Cluster Catalogue smoothed with a Gaussian filter 
with 0 — 24 arcmin. The spacing of the contours is linear (Schindler et ah, 1999) 

mass is about three times the visible mass in galaxies, assuming M/L = 20 for 
the galaxies), while it accounts only for 8% to 14% of the total mass at 0.4 and 
1 Mpc, respectively. The projected mass-to-light ratio has a roughly constant value 
of about 500 M 0 /L 0 . Thus, the dark matter, which defines the gravitational potential 
in the cluster and which is traced by the distribution of hot gas, must have a similar 
distribution to that of the visible matter. 

Another beautiful example of the combined use of X-ray imaging and spec- 
troscopy is provided by the observations and analysis of the rich cluster Abell 1413 
by Pratt and Arnaud (Pratt and Arnaud, 2002). The observations by the XMM- 
Newton X-ray Observatory included spatially resolved X-ray spectroscopy of the 
cluster X-ray emission and so the projected temperature variation with radius in the 
cluster could be determined. Their results are shown in Fig. 4.10. First, the average 
X-ray surface brightness distribution as a function of radius is fitted by an empirical 
model (Fig. 4. 10a). Then, the projected average temperature of the gas is estimated in 
annuli at different radial distances from the centre of the cluster (Fig. 4.10b). These 
are deprojected to derive the variation of the total mass within radius r using (4.22) 
(Fig. 4. 10c). Finally, the ratio of gas density to total density as a function of radius, or 
in the case of Fig. 4.10d, the overdensity relative to the critical cosmological density, 
can be found. 

These data are typical of what is found in rich clusters of galaxies. The dominant 
form of mass is the dark matter the nature of which is unknown. About 20% of the 
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Fig. 4.10a-d. Illustrating the determination of the physical properties of the cluster A1413 
from X-ray imaging and spectroscopy by the XMM-Newton X-ray Observatory, a The X-ray 
brightness distribution as a function of distance from the centre of the cluster, b The projected 
radial distribution of temperature of the gas. c The integrated mass distribution as a function 
of distance form the centre, d The fraction of gas density to total mass density / gas within the 
cluster as a function of overdensity S relative to the critical cosmological density (Pratt and 
Amaud, 2002) 



mass is in the form of hot intergalactic and this is typically about five times the mass 
in the visible parts of galaxies. The spectroscopic observations also enable the mass 
of iron in the intracluster medium to be determined and this is typically found to be 
between about 20 and 50% of the solar value, indicating that the intergalactic gas 
has been enriched by the products of stellar nucleosynthesis. 

The wealth of data from the recent generation of X-ray satellites has enabled 
important insights into their origin and evolution of clusters to be obtained. It is con- 
venient to adopt a simple reference model for understanding the various correlations 
between the properties of clusters, what is referred to as the self-similar model for 
cluster formation; this will be discussed in more detail in Chap. 16. 
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The simplest picture of cluster formation is to suppose that clusters developed 
from small perturbations in the distribution of the dark matter by gravitational col- 
lapse (see Part II). The amplitude of the perturbations grew from values Sq/q ~ 1 0 3 
at the epoch of recombination to large amplitudes Sq/q ^ 1 at late epochs. The 
considerations of Sect. 16.1 suggest that clusters became virialised bound systems 
when (Sq/q) v ~ 200. In the standard self-similar model, it is supposed that clusters 
of all masses virialised at the same epoch with the same values of (Sq/q) v . As a re- 
sult, their masses are simply proportional to q w R 3 , where q v was the mean density of 
the Universe at the epoch of virialisation. When the clusters virialised, they satisfied 
the virial theorem, GM 2 /R = Mo 2 . Since M oc R 3 , it follows that er v oc R, where 
R can be taken to be the characteristic radius of the cluster. 

The bolometric X-ray luminosity of the intracluster gas due to thermal brems- 
strahlung is given by the relation 

L x oc VN 2 T 1/2 , (4.27) 



where V ~ R 3 is the volume of emitting gas, N e is the electron density and T 
the temperature of the gas (Longair, 1997b). The gas forms an atmosphere within 
the gravitational potential of the cluster as a whole, the latter being defined by the 
distribution of the dark matter. Equation (4.19) can be written to order of magnitude 
as 
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(4.28) 



where it is assumed that the ions and electrons both contribute to the pressure of the 
gas. According to the virial theorem (3.19), the velocity dispersion of the galaxies 
is related to the gravitational potential by GM/R ~ o 2 and so 

kT ~ m p o 2 , (4.29) 



that is, T oc a 2 . If the fraction of the total mass of the cluster in hot intracluster gas 
is r /, it follows that the X-ray luminosity of the cluster is 

L x oc oc Ro;T l/2 oc o v 4 , (4.30) 

where it has been assumed that i] takes the same value for clusters of different 
masses. Then, since M oc R 3 oc a 3 , this model results in the prediction that the 
X-ray luminosity and the mass of the cluster are related by 

L x oc M 4/3 . (4.31) 



The scaling relations (4.30) and (4.31) have been the subject of considerable 
study. Figure 4.11 shows the results of a major study of X-ray clusters discovered in 
the ROSAT All Sky Survey (Reiprich and Bohringer, 2002). It can be seen that the 
best-fit relation is slightly steeper than the predictions of the scaling model, but it is 
interesting that the general trend is not so different from the simple picture. 




Fig. 4.11. The correlation between the X-ray luminosity of a cluster of galaxies and its total 
mass for the brightest X-ray clusters of galaxies in the ROSAT All Sky Survey (Reiprich 
and Bohringer, 2002). The mass was determined to the virial radius which was taken to be 
a density 200 greater than the critical cosmological density. The best-fit relation for their 
sample of 106 clusters is shown by the solid line. The best-fit relation for a smaller sample of 
63 clusters which form a better complete sample is shown by the tripledot-dashed line. The 
dot-dashed line shows the expectation of the simple self-similar relation L x oc M 4 / 3 . The 
dashed line is the prediction of a model with preheating of the intracluster gas 



Another test of this model is the X-ray luminosity-velocity dispersion relation 
which has been analysed in some detail by Ortiz-Gil and her colleagues (Ortiz-Gil 
et al., 2004). For a large sample of clusters from the REFLEX catalogue derived from 
the ROSAT All Sky Survey, they found a correlation of the form L x oc cr^ 41±0 3 \ 
in good agreement with the correlation (4.30) predicted by the self-similar model 
(Fig. 4. 1 2). A similar result was found for a smaller sample of clusters which formed 
a more complete statistical sample. Although this result is encouraging, their results 
for the L x — T and o v — T relations differ from the expectations of the self-similar 
model. They found L x oc 7 ’ 31±0 2 compared with the expected relation L x a T 2 and 
cry oc 7 ’ 100±016 compared with the predicted cr v oc T 0 5 . They noted that these last 
two relations are compatible with the X-ray luminosity-velocity dispersion relation. 
They inferred that there must be other contributions to the thermal heating and 
cooling of the intracluster gas than simply the thermal energy which resulted from 
the initial process of virialisation. There is now considerable evidence that such 
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Fig. 4.12. A plot of bolometric X-ray luminosity L x versus radial velocity dispersion a v for 
the REFLEX sample of 171 clusters. The linear fit has logarithmic slope of (4.1 ± 0.3). The 
dashed lines are the lcr errors. Filled circles correspond to clusters at redshift z < 0.05, open 
triangles are clusters with 0.05 < z S 0. 1 and open circles are clusters at z > 0. 1 (Ortiz-Gil 
et al„ 2004) 



additional sources of heating and cooling are important in understanding the state of 
the intracluster gas. 

If the density of the hot intracluster gas is large enough, its cooling rate can 
be sufficiently great for it to cool over cosmological time-scales. At high enough 
temperatures, the principal energy loss mechanism for the gas is the same thermal 
bremsstrahlung process which is responsible for the X-ray emission. The total energy 
loss rate per unit volume due to thermal bremsstrahlung is 

J = 1.435 x lQ- 40 Z 2 TigNN e Wm~ 3 , (4.32) 

where Z is the charge of the ions, N and N e are the number densities of ions and 
electrons respectively and g is a mean Gaunt factor which has value roughly 1 - we 
assume Z = 1 and N — N e . The thermal energy density of the fully ionised plasma 
is e = 3NkT and so the characteristic cooling time for the gas is 
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where the temperature is measured in kelvins and the number density of ions or 
electrons is measured in particles m 3 . Thus, if the typical temperature of the gas 
is 10 7 to 10 8 K, the cooling time is less that 10 10 years if the electron density is 
greater than about 3 x 10 3 to 10 4 m 3 . These conditions are indeed found in many 
of the clusters of galaxies which are intense X-ray emitters. As a result, the central 
regions of these hot gas clouds cool and, to preserve pressure balance, the gas density 
increases, resulting in the formation of a cooling flow. 

Evidence for these cooling flows and their role in the evolution of clusters 
of galaxies was reviewed by Fabian (Fabian, 1994). An example of the cooling 
flow in the cluster Abell 478 is illustrated by the diagrams shown in Fig. 4.13. 
The ROSAT observations have been deprojected to determine mean values of the 
density and temperature of the gas as a function of radial distance from the centre. 
The temperature decreases towards the central regions while the electron density 
increases to values greater than 10 4 m 3 in the very centre. At a radius of 200 kpc, 
the electron temperature is T — lx 10 7 K and the electron density N e = 8 x 10 3 m 3 . 
Inserting these values into (4.33), we find that the cooling time is 10 10 years. It can 
be seen that, outside this radius, the temperature of the gas is constant, whilst at 
smaller radii the gas temperature decreases towards the central regions. 

As a result, matter drifts slowly in through the surface at radius r coo | , at which 
the cooling time of the gas is equal to the age of the cluster. The X-ray luminosity of 




Fig. 4.13. The properties of the intracluster gas in the cluster Abell 478 obtained by depro- 
jecting images taken by the ROSAT X-ray Observatory (White et al., 1994). The cooling time 
of the gas is less than 10 10 years within a radius of 200 kpc (Fabian, 1994) 
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the cooling flow results from the internal energy of each element of the gas as well 
as the work done as it drifts slowly in towards the central regions whilst maintaining 
hydrostatic equilibrium. The appropriate conserved quantity is 

qv 2 (^v 2 + w) , (4.34) 

where q, v and w are the density, velocity and enthalpy of the gas flow. Note that 
the relevant quantity which describes the energy content of the flow is the enthalpy 
w — e + pV, where e is the internal energy of the gas and p and V are its pressure 
and specific volume. This is because, in addition to its internal thermal energy, 
work is done on the volume element by the pressure of the gas. When the volume 
element drifts within the cooling radius r coo i, all of the energy associated with the 
enthalpy is available for heating the gas and so the energy deposited per unit mass 
is w = € + pV = | nkT for a perfect gas, where n is the number density of particles 
in the flow. It follows that the total energy input to the gas is determined by the rate 
at which mass drifts inwards through any radius, N. In the steady state, this energy 
is reradiated by the gas as X-rays and so the cooling luminosity of the flow L coo i is 

Tcooi = \NkT= \—kT, (4.35) 

p,m 

where m is the mass of the hydrogen atom and /x is the mean molecular weight of the 
gas. Thus, knowing the total X-ray luminosity due to cooling, and the temperature of 
the gas, (4.35) can be used to work out the mass flow rate M through any radius. The 
result of this calculation for the cluster Abell 478 is shown in Fig. 4.13. The cooling 
flow results in a mass inflow rate of about 600 to 800 M Q y 1 and so over a period 
of 10 10 years, such cooling flows can contribute significantly to the baryonic mass 
in the central regions of the cluster. 

According to Fabian, about half of the clusters detected by the Einstein X-ray 
Observatory have high central X-ray surface brightnesses and cooling times less 
than 10 10 years (Fabian, 1994). Abell 478 has a particularly massive flow. Typically, 
the inferred mass flow rates are about 100 to 300 Mq y" 1 . 

This cannot be the whole story, however, since X-ray spectroscopic observations 
of the cores of clusters have shown that there is an absence of cool gas which would 
be expected if there were no other energy sources. This is most vividly demonstrated 
by observations by the ESA XMM-Newton satellite, for example, the observations 
of the cluster Sersic 159-03 which has a cool core (de Plaa et al., 2005). Figure 4.14 
shows the X-ray spectrum of the cluster, the solid line showing the wealth of X-ray 
emission lines expected according to standard models of cooling flows. It can be 
seen that the observed spectrum differs dramatically from the expectations of the 
cooling flow models, because of the absence of strong lines associated with ions 
such as FeXVII. This lack of cool gas seems to be a feature of many of the cooling 
flows observed in rich clusters of galaxies (Kaastra et al., 2004). The inference is that 
there must be some further heating source within the cluster to reheat the cooling 
gas. 

Many models have been proposed to resolve this problem, some of these being 
listed by Kaastra and his colleagues (Kaastra et al., 2004). These models include 
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Fig. 4.14. Comparison of the observed high resolution X-ray spectrum of the cluster of galaxies 
Sersic 159-03 observed by the ESA XMM-Newton satellite with the predicted spectrum of 
a standard cooling flow model without heating. The strong lower excitation lines from ions 
such as Fe XVII are absent, indicating the lack of cool gas in the cluster (de Plaa et al., 2005) 



metallicity inhomogeneities, buoyantly rising radio bubbles transporting cool gas 
outwards, halo-in-halo structures, turbulent mixing due to rising and falling hot gas 
bubbles, heating by AGN activity, contamination due to non-thermal X-ray emission, 
heating by dead radio galaxies, rapid cooling due to mixing with cold gas and heat 
conduction by electrons. They also propose heating by coronal loops, similar to 
those observed in the corona of the Sun, but on the scale of the cores of clusters of 
galaxies. 

A highly suggestive set of observations made by the Chandra X-ray Observatory 
shows that the cooling gas in the central regions of a number of clusters is perturbed 
by the presence of radio lobes of recent radio source events. One of the most 
compelling examples is the central region of the Perseus cluster of galaxies in 
which the buoyant lobes of relativistic plasma have pushed back the intracluster gas, 
forming ‘holes’ in the X-ray brightness distribution (Fig. 4. 15a) (Fabian et al., 2000). 
In a very long X-ray exposure with the Chandra X-ray Observatory, Fabian and his 
colleagues identify what they interpret as isothermal sound waves produced by the 
weak shock wave associated with the expanding lobes (Fig. 4.15b). They show that 
the energy injected into the intracluster gas by these sound waves can balance the 
radiative cooling of the cooling flow (Fabian et al., 2006). 
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Fig. 4.15a, b. The central regions of the Perseus cluster of galaxies as observed by the Chandra 
X-ray Observatory, a The central regions of the cluster showing the cavities evacuated by the 
radio lobes which are shown by the white contour lines (Fabian et al., 2000). b An unsharp- 
mask image of the central regions of the cluster showing the various features caused by the 
expanding radio lobes. Many of the features are interpreted as sound waves caused by the 
weak shock wave associated with the expansion of the radio lobes (Fabian et al., 2006) 



4.5 The Sunyaev-Zeldovich Effect in Hot Intracluster Gas 



An independent method of studying hot gas in clusters of galaxies and elsewhere 
in the Universe is through the search for decrements in the intensity of the Cosmic 
Microwave Background Radiation. As the photons of the background radiation pass 
through the gas cloud, a few of them suffer Compton scattering by the hot electrons. 
Although to first order the photons are just as likely to gain as lose energy in 
these scatterings, to second order there is a net statistical gain of energy and so 
the spectrum of the Cosmic Microwave Background Radiation is shifted to slightly 
higher energies. As a result, there is expected to be a decrease in the intensity of 
the background radiation in the Rayleigh-Jeans region of the spectrum, that is, at 
energies hv kT r , while in the Wien region, /iv » kT r , there should be a slight 
excess - T r is the temperature of the background radiation. These predictions were 
made by Sunyaev and Zeldovich as long ago as 1969 (Sunyaev and Zeldovich, 1970) 
and it was almost 20 years before the Sunyaev-Zeldovich effect was observed with 
confidence in the directions of clusters of galaxies (Birkinshaw, 1990). 

The magnitude of the distortion is determined by the Compton scattering optical 
depth y through the region of hot gas, 

y = J (J^jcJ T N e dl. (4.36) 

The resulting decrement in the Rayleigh-Jeans region of the spectrum is 
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(4.37) 



